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RECOMBIN ANT VESICULOVIRUSES AND THEIR USES 

This invention was made with government support 
under grant number R37 AI243245 awarded by the National 
5 Institutes of Health. The government has certain rights in 
the invention. 

1. INTRODUCTION 
The present invention relates to recombinant 
10 vesiculoviruses which are replicable and capable of 

expressing foreign nucleic acid contained in their genome. 
Also provided are inactivated forms of the recombinant 
viruses. The vesiculoviruses are useful in vaccine 
formulations to prevent or treat various diseases and 
15 disorders. 

2. BACKGROUND OF THE INVENTION 
2.1. RHABDOVIRUSES 

♦ 

^. Bhabdoviruses are membrane-enveloped viruses that 

2 0 are widely distributed in nature where they infect 

vertebrates, invertebrates, and plants. There are two 
distinct genera within the. rhabdoviruses, the Lyssavirus 
genus and the Vesiculovirus genus. Rhabdoviruses have 
single, negative-strand RNA genomes of 11-12,000 nucleotides 

25 (Rose and Schubert, 1987, Rhabdovirus genomes and their 
products, in The Viruses: The Rhabdoviruses , Plenum 
Publishing Corp., NY, pp. 129-166). The virus particles 
contain a helical, nucleocapsid core composed of the genomic 
RNA and protein. Generally, three proteins, termed N 

30 (nucleocapsid), P (formerly termed NS, originally indicating 
nonstructural), and L (large) are found to be associated with 
the nucleocapsid. An additional matrix (M) protein lies 
within the membrane envelope, perhaps interacting both with 
the membrane and the nucleocapsid core. A single 

35 glycoprotein (G) species spans the membrane and forms the 
spikes on the surface of the virus particle. Because the 
genome is the negative sense [i.e., complementary to the RNA 
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sequence (positive sense) that functions as mRNA to directly 
produce encoded protein], rhabdoviruses must encode and 
package an RNA-dependent RNA polymerase in the virion 
(Baltimore et al., 1970, Proc. Natl. Acad. Sci. USA 66: 
S 572-576), composed of the P and L proteins. This enzyme 
transcribes genomic RNA to make subgenomic MRNAS encoding the 
5-6 viral proteins and also replicates full-length positive 
and negative sense RNAs. The genes are transcribed 
sequentially, starting at the 3' end of the genomes. The 
10 same basic genetic system is also employed by the 
paramyxoviruses and filoviruses. 

The prototype rhabdovirus, vesicular stomatitis 
virus (VSV) , grows to very high titers in most animal cells 
and can be prepared in large quantities. As a result, VSV 
15 has been widely used as a model system for studying the 

replication and assembly of enveloped RNA viruses. The study 
of VSV and related negative strand viruses has been limited 
by the inability to perform direct genetic manipulation of 
the virus using recombinant DNA tech,.clc gi . Vne difficulty 
20 in generating VSV from DNA is that neither the full-length 
genomic nor antigenomic RNAs are infectious. The minimal 
infectious unit is the genomic RNA tightly bound to 1,250 
subunits of the nucleocapsid (N) protein (Thomas et al., 
1985, J. Virol. 54:598-607) and smaller amounts of the two 
25 virally encoded polymerase subunits, L and P. To 

reconstitute infectious virus from the viral RNA, it is 
necessary first to assemble the N protein-RNA complex that 
serves as the template for transcription and replication by * 
the VSV polymerase. Although smaller negative-strand RNA 
30 segments of the influenza virus genome can be packaged into 
nucleocapsids in vitro, and then rescued in influenza 
infected cells (Enami et al., 1990, Proc. Natl. Acad. Sci. 
USA 87:3802-3805; Luytjes et al., 1989, Cell 59:1107-1113), 
systems for packaging the much larger rhabdoviral genomic ' 
35 RNAs in vitro are not yet available. 

Recently, systems for replication and transcription 
of DNA-derived minigenomes or small defective RNAs from 
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rhabdoviruses (Conzelmann and Schnell, 1994, J. virol. 
68:713-719; Pattnaik et al., 1992, Cell 69:1011-1120) and 
paramyxoviruses (Calain et al., 1992, Virology 191:62-71; 
Collins et al., 1991, Proc. Natl. Acad. Sci. USA 
5 88:9663-9667; Collins et al., 1993, Virology 195:252-256; De 
and Banerjee, 1993, Virology 196:344-348; Dimock and Collins, 
1993, J. Virol. 67:2772-2778; Park et al., 1991, Pro c. Natl. 
Acad. Sci. USA 88:5537-5541) have been described. in these 
systems, RNAs are assembled into nucleocapsids within cells 
10 that express the viral N protein and polymerase proteins. 
Although these systems have been very useful, they do not 
allow genetic manipulation of the full-length genome of 
infectious viruses. 

The recovery of rabies virus from a complete cDNA 
15 clone was published recently (Schnell et al. , 1994, EMBO J. 
" 13:4195-4203). The infectious cycle was initiated by - 
expressing the antigenomic (full-length positive strand) RNA 
in cells expressing the viral N, p, and L proteins. Although 
rabies -vi.xwer.As a rhabdovirus, it is structurally and 
20 functionally different from the vesiculoviruses. Rabies 
virus is a Lyssavirus, not a Vesiculovirus. Lyssaviruses 
invade the central nervous system. Vesiculoviruses invade 
epithelial cells, predominantly those of the tongue, to 
produce vesicles. Rabies virus causes encephalitis in a 

25 variety of animals and in humans, while VSV causes an 
epidemic but self-limiting disease in cattle. In sharp 
contrast to VSV-infected cells, rabies virus produces little 
or no cytopathic effect in infected cell culture, replicates 
less efficiently than VSV in cell culture, and causes little 

30 depression of cellular DNA, RNA or protein synthesis in 
infected cell cultures (see Baer et al., 1990, in Virology, 
2d ed., Fields et al. (eds.), Raven Press, Ltd., NY, pp. 883, 
887). indeed, there is no cross-hybridization observed 
between the genomes of rabies virus and VSV, and sequence' 

35 homology between the two genomes is generally discernable 
only with the aid of computer run homology programs. The 
differences between vesiculoviruses and rabies virus, and the 
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extremely rare nature of rabies virus recovery from cDNA (-10 
cells are transfected to yield one infectious cell) , renders 
it unpredictable whether the strategy used with rabies virus 
would be successful for viruses of a different genus, i.e., 
5 the vesiculoviruses. 

The recovery of infectious measles virus, another 
negative strand RNA virus, from cloned cDNA has been 
attempted, without- success (see Ballart et al., 1990, EMBO J. 
9(2):379-384 and the retraction thereof by Eschle et al., 
10 1991, EMBO J. 10(11) :3558) . 

2.2. VACCINES 

The. development of vaccines for the prevention of 
viral, bacterial, or parasitic diseases is the focus of much 
IS research effort. 

Traditional ways of preparing vaccines include the 
use of inactivated or attenuated pathogens. A suitable 
inactivation of the pathogenic microorganism renders it 
harmless as a biological agent but does no u- destroy its 

20 immunogenicity . Injection of these "killed- particles into a 
host will then elicit an immune response capable of 
preventing a future infection with a live microorganism. 
However, a major concern in the use of killed vaccines (using 
inactivated pathogen) is failure to inactivate all the 

25 microorganism particles. Even when this is accomplished, 
since killed pathogens do not multiply in their host, or'for 
other unknown reasons, the immunity achieved is often 
incomplete, short lived and reguires multiple immunizations. . 
Finally, the inactivation process may alter the 

3 0 microorganism's antigens, rendering them less effective as 
immunogens. 

Attenuation refers to the production of strains of 
pathogenic microorganisms which have essentially lost their 
disease-producing ability. One way to accomplish this is to 
35 subject the microorganism to unusual growth conditions and/or 
frequent passage in cell culture. Mutants are then selected 
which have lost virulence but yet are capable of eliciting an 
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immune response. Attenuated pathogens often make good 
immunogens as they actually replicate in the host cell and 
elicit long lasting immunity. However, several problems are 
encountered with the use of live vaccines, the most worrisome 
5 being insufficient attenuation and ,the risk of reversion to 
virulence. 

An alternative to the above methods is the use of 
subunit vaccines. This involves immunization only with those 
components which contain the relevant immunological material . 
10 Vaccines are often formulated and inoculated with 

various adjuvants. The adjuvants aid in attaining a more 
durable and higher level of immunity using small amounts of 
antigen or fewer doses than if the immunogen were 
administered alone. The mechanism of adjuvant action is 
15 complex and not completely understood. However, it may 

involve the stimulation of cytokine production, phagocytosis 
and other activities of the reticuloendothelial system as 
well as a delayed release and degradation of the antigen. 
Examples oi ■ adjusts include Freund's adjuvant (complete or 
20 incomplete), Adjuvant 65 (containing peanut oil, mannide 
monooleate and aluminum monostearate) , the pluronic polyol 
L-121, Avridine, and mineral gels such as aluminum hydroxide, 
aluminum phosphate, etc. Freund's adjuvant is no longer used 
in vaccine formulations for humans because it contains 
25 nonmetabolizable mineral oil and is a potential carcinogen. 

3 . SUMMARY OF THE TMVFNTTnM 
The present invention provides recombinant 
replicable vesiculoviruses. The prior art has unsuccessfully 

30 attempted to produce replicable vesiculoviruses from cloned 
DNA. In contrast, the invention provides a method which, for 
the first time, has successfully allowed the production and 
recovery of replicable vesiculoviruses, as well as 
recombinant replicable vesiculoviruses, from cloned DNA. 

35 The vesiculoviruses of the invention are produced 

by providing in an appropriate host cell: (a) DNA that can 
be transcribed to yield (encodes) vesiculovirus antigenomic 
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(+) RNA (complementary to the vesiculovirus genome) , (b) a 
recombinant source of vesiculovirus N protein, (c) a 
recombinant source of vesiculovirus P protein, and (d) a 
recombinant source of vesiculovirus L protein; under 

5 conditions such that the DNA is transcribed to produce the 
antigenomic RNA, and a vesiculovirus is produced that 

■ contains genomic RNA complementary to the antigenomic RNA 
produced from the DNA. 

The invention provides an infectious recombinant 
10 vesiculovirus capable of replication in an animal into which 
the recombinant vesiculovirus is introduced, in which the 
genome of the vesiculovirus comprises foreign RNA which is 
not naturally a part of the vesiculovirus genome. The 
recombinant vesiculovirus is formed by producing 
15 vesiculoviruses according to the method of the invention, ' in 
which regions of the DNA encoding, vesiculovirus antigenomic 
(+) RNA that are nonessential for viral replication have been 
inserted into or replaced with foreign DNA. 

In a preferred embodiment, the ioici^i ^ 
20 contained within the genome of the recombinant vesiculovirus 
(originally encoded by the foreign DNA) , upon expression in 
an appropriate host cell, produces a protein or peptide that 
is antigenic or immunogenic. 

The recombinant vesiculoviruses of the invention 
25 have use as vaccines. In one embodiment, where the foreign 
RNA | directs production of an antigen that induces an immune 
response against a pathogen, the vaccines of the invention 
have use in the treatment or prevention of infections by such 
a pathogen (particularly a pathogenic microorganism) , and its 
30 clinical manifestations, i.e., infectious disease. in a 
preferred embodiment, such an antigen displays the 
antigenicity or immunogenicity of an envelope glycoprotein of 
a virus other than a vesiculovirus, and the antigen is 
incorporated into the vesiculovirus envelope. The 
35 recombinant vesiculoviruses also have uses in diagnosis, and 
monitoring progression of infectious disorders, including 
response to vaccination and/ or therapy. 



- 6 - 



I 

■ BNSDOCID: <WO 9634625A1_I_> 



t 



WO 9604625 PCT/US96/06053 



In another embodiment, where the foreign RNA 
directs production of an antigen that induces an immune 
response against a tumor, the recombinant viruses of the 
invention have uses in cancer immunoprophylaxis, 
5 immunotherapy, and diagnosis, and monitoring of tumor 
progression or regression. 

The recombinant vesiculoviruses can be used as live 
vaccines, or can be inactivated for use as killed vaccines. 
The recombinant viruses can also be used to produce large 
10 quantities of readily purified antigen, e.g., for use in 
subunit vaccines. 

The invention also provides vaccine formulations, 
kits, and recombinant host cells. 

15 4 - DESCRIPTION OF THE FIGURES 

Figure l. Nucleotide sequence of plasmid 
pVSVFL(+) , showing the complete DNA sequence that is 
transcribed to produce VSV antigenomic (+) RNA, and predicted 
sequences of the Woflcd VSV proteins. [N protein: 
20 SEQ ID NO: 2; P protein: SEQ ID NO: 3; M protein: SEQ ID 

NO : 4 ; G protein: SEQ ID N0:5; L protein: SEQ ID NO:6] The 
noncoding and intergenic regions are observable. The upper 
line of sequence (SEQ ID NO:i) is the VSV antigenomic 
positive strand; lower line = SEQ ID NO: 7. Restriction sites 
25 are indicated. The transmembrane and cytoplasmic domains of 
the G protein are also indicated. The sequences of the first 
T7 RNA polymerase promoter (SEQ ID NO: 8), the second T7 RNA 
polymerase promoter (SEQ ID NO: 9); leader sequence (SEQ ID 
NO:l0), T7 RNA polymerase transcription termination signal 
30 (SEQ ID NO:ll), and the sequence that is transcribed to 
produce the HDV ribozyme (SEQ ID NO: 12) are shown. 

Figure 2. Nucleotide sequence of a portion of 
plasmid pVSVSSl, showing the synthetic DNA insert containing 
the polyl inker region inserted between the G and L coding 
35 regions (3' of G and 5' of L) containing unique restriction 
enzyme recognition sites, namely, Xmal, NotI, and Smal. 
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Upper line of sequence: SEQ ID NO: 13; lower line of 
sequence: SEQ ID NO: 14. 

Figure 3. Gene junctions of VSV. The nucleotide 
sequences at the 3 ' end of the leader RNA and the 5 ' and 3' 
5 ends of each mRNA are shown along with the corresponding 
genomic sequences (vRNA) (SEQ ID NO:15-31). The intergenic 
dinucleotides are indicated by bold letters. From Rose and 
Schubert, 1987, in The Viruses: The Rhabdoviruses , Plenum 
Press, NY, pp. 129-166, at p. 136. 
10 Figure 4. Plasmid DNA construction. A. The 

diagram illustrates the cloned VSV genomic sequence and the 
four DNA fragments (numbered 1-4) that were used to generate 
the plasmid pVSVFL(+). The horizontal arrows represent PCR 
primers used to generate fragments 1 and 3. B.. Diagram of 
15 the plasmid pVSVFL( + ) that gives rise to infectious VSV. 

The locations of the VSV genes encoding the five proteins N, 
P, M, G, and L are shown. The stippled region from Sac I to 
Xho I represents the.pBSSK+ vector sequence, and the hatched 
segments represent the regions of the VSV genome generated by 
2 0 PCR. Transcription from the T7 promoter generates the 
complete (+) strand VSV RNA. 

Figure 5. Proteins present in wild-type and 
recombinant VSVs. Proteins from 1% of the virus recovered 
from approximately 5 x 10 6 infected BHK cells were separated 
25 by SDS-PAGE (10% acrylamide) and visualized by staining with 
Coomassie brilliant blue. Positions of the five VSV proteins 
are indicated. 

Figure 6. Identification of a restriction enzyme 
recognition sequence in the recombinant VSV. A 620 

30 nucleotide segment of genomic RNA isolated from wildtype and 
recombinant VSV was amplified by reverse transcription and 
PCR using the primers 5 7 -CATTCAAGACGCTGCTTCGCAACTTCC 
(SEQ ID NO: 32) and 5 ' -CATGAATGTTAACATCTCAAGA (SEQ ID NO.: 3 3). 
Controls in which reverse transcriptase was omitted from the 

35 reaction are indicated. DNA samples were either digested 
with Nhe I or left undigested prior to electrophoresis on a 
6% polyacrylamide gel as indicated. DNA was detected by 
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staining with ethidium bromide. Sizes of DNA markers are 
indicated on the left. 

Figure 7. Autoradiogram showing the sequence of 
genomic RNA from recombinant VSV. RNA prepared from 
5 recombinant VSV was sequenced by the dideoxy method using 
reverse transcriptase. The written sequence corresponds to 
nucleotides 1563-1593 in the G mRNA (Rose and Gallione, 1981, 
J. Virol. 39:519-528). The underlined sequence represents 
the four nucleotides that were changed to generate the Nhe I 
10 site. 

Figure 8. Protein analysis of recombinant VSV 
expressing the glycoprotein from the New Jersey serotype. 
Proteins from 1% of the virus pelleted from the medium of 
approximately 5 x 10 6 BHK cells infected for 24 hours with 
15 wildtype VSV, (lane 1), recombinant VSV^ JC (lane 2) or wildtype 
VSV NJ (lane 3) were separated by SDS-PAGE (10% acrylamide) . 
The proteins were visualized by staining with Coomassie 
brilliant blue. Positions of viral proteins are indicated. 

20 5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides recombinant 
replicable vesiculoviruses. The prior art has unsuccessfully 
attempted to produce replicable vesiculoviruses from cloned 
DNA. In contrast, the invention provides a method which, for 

25 the first time, has successfully allowed the production and 
recovery of replicable vesiculoviruses, as well as 
recombinant replicable vesiculoviruses, from cloned DNA. 
Expression of the full-length positive-strand vesiculovirus 
RNA in host cells has successfully allowed the generation of 

30 recombinant vesiculoviruses from DNA, providing recombinant 
viruses that do not cause serious pathology in humans and 
that can be obtained in high titers, that have use as 
vaccines. 

The vesiculoviruses of the invention are produced 
35 by providing in an appropriate host cell: (a) DNA that can 
be transcribed to yield (encodes) vesiculovirus antigenomic 
(+) RNA (complementary to the vesiculovirus genome) , (b) a 
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recombinant source of vesiculovirus N protein, (c) a 
recombinant source of vesiculovirus P protein, and (d) a 
recombinant source of vesiculovirus L protein; under 
conditions such that the DNA is transcribed to produce the 
5 antigenomic RNA, and a vesiculovirus is produced that 
contains genomic RNA complementary to the antigenomic RNA 
produced from the DNA. 

The invention provides an infectious recombinant 
vesiculovirus capable of replication in an animal into which 

10 the recombinant vesiculovirus is introduced, in which the 
genome of the vesiculovirus comprises foreign RNA which is 
not naturally a part of the vesiculovirus genome. The 
recombinant vesiculovirus is formed by producing 
vesiculoviruses according to the method of the invention, in 

15 which regions of the DNA encoding vesiculovirus antigenomic 
( + ) RNA that are nonessential for viral replication have been 
inserted into or replaced with foreign DNA. 

Since the viruses are replicable (i.e., not 
replication-defective}; th^y encode all the vesiculovirus 

2 0 machinery necessary for replication in a cell upon infection 
by the virus. 

In a preferred embodiment, the recombinant 

vesiculovirus is a recombinant vesicular stomatitis virus 
(VSV) . 

25 In another preferred embodiment, the foreign RNA 

contained within the genome of the recombinant vesiculovirus 
(originally encoded by the foreign DNA), upon expression in 
an appropriate host cell, produces a protein or peptide that 
is antigenic or immunogenic. Such an antigenic or 

30 immunogenic protein or peptide whose expression is directed 
by the foreign RNA (present in the negative sense) within the 
vesiculovirus genome (by expression from the ( + ) antigenomic 
message) shall be referred to hereinafter as the "Antigen." 
Appropriate Antigens include but are not limited to known 

35 antigens of pathogenic microorganisms or of tumors, as well 
as fragments or derivatives of such antigens displaying the 
. antigenicity or immunogenicity of such antigens. A protein 
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displays the antigenicity of an antigen when the protein is 
capable of being immunospecif ically bound by an antibody to 
the antigen, A protein displays the immunogenicity of an 
antigen when it elicits an immune response to the antigen 
5 (e.g., when immunization with the protein elicits production 
of an antibody that immunospecif ically binds the antigen or 
elicits a cell-mediated immune response directed against the 
antigen) . 

The recombinant vesiculoviruses of the invention 

10 have use as vaccines. In one embodiment, where the foreign 
RNA directs production of an Antigen (originally encoded by 
the foreign DNA used to produce the recombinant vesiculovirus 
or its predecessor) that induces an immune response against a 
pathogen, the vaccines of the invention have use in the 

15 treatment or prevention of infections by such a pathogen 
(particularly a pathogenic microorganism) , and its clinical 
manifestations, i.e., infectious disease. The invention thus 
provides methods of prevention or treatment of infection and 
infectious disease comprising administering to a subject- in 

20 which such treatment or prevention is desired one or more of 
the recombinant vesiculoviruses of the invention. The 
recombinant vesiculoviruses also have uses in diagnosis, and 
monitoring progression of infectious disorders, including 
response to vaccination and/or therapy. 

25 In another embodiment, where the Antigen induces an 

immune response against a tumor, the recombinant viruses of 
the invention have uses in cancer immunoprophylaxis, 
immunotherapy, and diagnosis, and monitoring of tumor 
progression or regression. 

30 The recombinant vesiculoviruses can be used as live 

vaccines, or can be inactivated for use as killed vaccines. 
The recombinant viruses can also be used to produce large 
quantities of readily purified antigen, e.g., for use in 
subunit vaccines. 

35 In a specific embodiment, the foreign DNA used 

initially for production of the recombinant vesiculoviruses 
can also comprise a sequence encoding a detectable marker, 
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e*g. t 0-galactosidase, 0 -glucuronidase, 0-geo (Friedrich & 
Soriano, 1991, Genes Dev. 5:1513-1523). 

In another specific embodiment, the foreign DNA can 
also comprise a sequence encoding a cytokine capable of 
5 stimulating an immune response. Such cytokines include but 
are not limited to, interleukin-2 , inter leukin-6, 
interleukin-12 , interferons, and granulocyte-macrophage 
colony stimulating factors. 

In a preferred aspect, upon infection with a 
10 recombinant vesiculovirus of the invention, the Antigen is 
expressed as a nonfusion protein. In a less preferred 
embodiment, the Antigen is expressed as a fusion protein, 
e.g., to the viral G protein. "Fusion protein," as used 
herein, refers to a protein comprising an amino acid sequence 
15 from a first protein covalently linked via a peptide bond at 
its carboxy terminus to the amino terminus of an amino acid 
sequence from a second, different protein. 

In one embodiment, a vaccine formulation of the 
invention contains a -ii:?\*;'type of recombinant vesiculovirus 
20 of the invention. In another embodiment, a vaccine 

formulation comprises a mixture of two or more recombinant 
viruses of the invention. 

The vaccine formulations of the invention provide 
one or more of the following benefits: stability for long 
25 periods without refrigeration; ease of production; low cost 
and high titer of production; ability to be administered by 
local workers without advanced medical training; and 
involving administration of a microorganism that ' is known not 
to cause serious disease in humans. 
30 The present invention also provides a host cell 

infected with a recombinant vesiculovirus capable of 
replication. In one embodiment, the host cell is a mammalian 
cell. Preferably, the mammalian cell is a hamster kidney 
cell. 1 • 

35 

t 
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5.1. DNA THAT CAN BE TRANSCRIBED TO PRODUCE 
VESICULOVIRUS ANTIGENOMIC ( + ) RNA 

Many vesiculoviruses are known in the art: and can 
be made recombinant according to the methods of the 
5 invention. Examples of such vesiculoviruses are listed in 
Table I. 



10 



Virus 



VSV-New Jersey 



TABLE I 

MEMBERS OF THE VESICULOVIRUS GENUS 
Source of virus in nature 



Mammals, mosquitoes, midges, 
blackf lies , housef lies 





VSV-Indiana 


Mammals, mosquitoes, sandflies 


15 


Alagoas 


Mammals, sandflies 




Cocal 


Mammals, mosquitoes, mites 




Jurona 


Mosquitoes 




Carajas 


Sandflies 


20 








Maraba 


Sandflies 




Piry 


Mammals 




Calchagui 


Mosquitoes 


25 


Yug Bogdanovac 


Sandflies 




Isfahan 


Sandflies, ticks 




Chandipura 


Mammals , sandflies 




Perinct 


Mosquitoes, sandflies 


30 


Porton-S 


Mosquitoes 



35 



Any DNA that can be transcribed to produce 
vesiculovirus antigenomic ( + ) RNA (complementary to the VSV 
genome) can be used for the construction of a recombinant DNA 
containing foreign DNA encoding an Antigen, for use in 
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producing the recombinant vesiculoviruses of the invention, 
DNA that can be transcribed to produce vesiculovirus 
antigenomic ( + ) RNA (such DNA being referred to herein as 
"vesiculovirus (-) DNA") is available in the art and/or can 
5 be obtained by , standard methods. In particular, plasmid 
pVSVFL(+), containing VSV (-) DNA that is preferred for use 
in the present invention, has been deposited with the ATCC 
and assigned accession no. 97134. In a preferred aspect, DNA 
that can be transcribed to produce VSV (+) RNA, [i.e., 

10 VSV (-) DNA], is used. VSV (-) DNA for any serotype or 
strain known in the art, e.g., the New Jersey or Indiana 
serotypes of VSV, can be used. The complete nucleotide and 
deduced protein sequence of the VSV genome is known, and is 
available as Genbank VSVCG, Accession No. J02428; NCBI Seg ID 

15 3 35873; and is published in Rose and Schubert, 1987, in The 
Viruses: The Rhabdoviruses , Plenum Press, NY, pp. 12 9-166. 
Partial sequences of other vesiculovirus genomes have been 
published and are available in the art. The complete 
ziiquence of the VSV(-) DLIA ±z used in a preferred 

20 embodiment is contained in plasmid pVSVFL(+) and is shown in 
Figure 1; also shown are with the predicted sequences of the 
VSV proteins (this sequence contains several sequence 
corrections relative to that obtainable from Genbank) . 
Vesiculovirus (-) DNA, if not already available, can be 

25 prepared by standard methods, as follows: If vesiculoviral 
cDNA is not already available, vesiculovirus genomic RNA can 
be purified from virus preparations, and reverse 
transcription with long distance polymerase chain reaction 
used to generate the vesiculovirus (-) DNA. Alternatively, 

30 after purification of genomic RNA, VSV mRNA can be 

synthesized in vitro, and cDNA prepared by standard methods, 
followed by insertion into cloning vectors (see, e.g., Rose 
and Gallione, 1981, J. Virol. 39 (2 ): 519-528) . Individual 
cDNA clones of vesiculovirus RNA can be joined by use of 

35 small DNA fragments covering the gene junctions, generated by 
use of reverse transcription and polymerase chain reaction 
(RT-PCR) (Mullis and Faloona, 1987, Meth. Enzymol. 
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155:335-350) from VSV genomic RNA (see Section 6, infra). 
Vesiculoviruses are available in the art. For example, VSV 
can be obtained from the American Type Culture Collection. 

In a preferred embodiment, one or more, preferably 
5 unique, restriction sites (e.g., in a polylinker) are 
introduced into the vesiculovirus (-) DNA, in intergenic 
regions, or 5' of the sequence complementary to the 3' end of 
the vesiculovirus genome, or 3' of the sequence complementary 
to the 5' end of the vesiculovirus genome, to facilitate 
10 insertion of the foreign DNA. 

In a preferred method of the invention, the 
vesiculovirus (-) DNA is constructed so as to have a promoter 
operatively linked thereto. The promoter should be capable 
of initiating transcription of the (-) DNA in an animal or 
15 insect cell in which it is desired to produce the recombinant 
vesiculovirus. Promoters which may be used include, but are 
not limited to, the SV40 early promoter region (Bernoist and 
Chambon, 1981, Nature 290 : 304-310) , the promoter contained in 
the 3 5 long terminal rspcaL of Rous sarcoma virus (YamaiLoio . 
20 et al., 1980, Cell 22:787-797), the herpes thymidine kinase 
promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 
78:1441-1445), the regulatory sequences of the 
metallothionein gene (Brinster et al., 1982, Nature 
296:39-42); heat shock promoters (e.g., hsp70 for use in 
25 Drosophila S2 cells) ; the ADC (alcohol dehydrogenase) 

promoter, PGK (phosphoglycerol kinase) promoter, alkaline 
phosphatase promoter, and the following animal 
transcriptional control regions, which exhibit tissue 
specificity and have been utilized in transgenic animals: 
30 elastase I gene control region which is active in pancreatic 
acinar cells (Swift et al., 1984, Cell 38:639-646; Ornitz et 
al. , 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 
MacDonald, 1987, Hepatology 7 : 425-515) ; insulin gene control 
region which is active in pancreatic beta cells (Hanahan, 
35 1985, Nature 315:115-122), immunoglobulin gene control region 
which is active in lymphoid cells (Grosschedl et al., 1984, 
Cell 38:647-658; Adames et al., 1985, Nature 318:533-538; 
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Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444), mouse 
mammary tumor virus control region which is active in 
testicular, breast, lymphoid and mast cells (Leder et al., 
1986, Cell 45:485-495), albumin gene control region which is 
5 active in liver (Pinkert et al., 1987, Genes and Devel. 
1:268-276), alpha-f etoprotein gene control region which is 
active in liver (Krumlauf et al., 1985, Mol. Cell. Biol. 
5:1639-1648; Hammer et al. , 1987, Science 235:53-58 ; alpha 1- 
antitrypsin gene control region which is active in the liver 
10 (Kelsey et al., 1987, Genes and Devel. 1:161-171), beta- 
globin gene control region which is active in myeloid cells 
(Mogram et al., 1985, Nature 315:338-340; Kollias et al., 
.1986, Cell 46:89-94; myelin basic protein gene control region 
which is active in oligodendrocyte cells in the brain 
15 (Readhead et al. , 1987, Cell 48 : 703-712) ; and myosin light 
chain-2 gene control region which is active in skeletal 
muscle (Sani, 1985, Nature 314:283-286). Preferably, the 
promoter is an RNA polymerase promoter, preferably a 
ba^L^riophage or viral or ir.sfcfft.--BW . polymerase promoter, 
20 including but not limited to the promoters for T7 RNA 

polymerase, SP6 RNA polymerase, and T3 RNA polymerase. If an 
RNA polymerase promoter is used in which the RNA polymerase 
is not endogenously produced by the host cell in which it is 
desired to produce the recombinant vesiculovirus, a 
25 recombinant source of the RNA polymerase must also be 
provided in the host cell. 

The vesiculovirus (-) DNA can be operably linked to 
a promoter before or after insertion of foreign DNA encoding 
an Antigen. Preferably, a transcriptional terminator is 

30 situated downstream of the vesiculovirus (-) DNA. 

In another preferred embodiment, a DNA sequence 
that can be transcribed to produce a ribozyme sequence is 
situated at the immediate 3' end of the vesiculovirus (-) 
DNA, prior to the transcriptional termination signal, so that 

35 upon transcription a self-cleaving ribozyme sequence is 

produced at the 3' end of the antigenomic RNA, which ribozyme 
sequence will autolytically cleave (after a U) this fusion 
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transcript to release the exact 3' end of the vesiculovirus 
antigenomic (+) RNA. Any ribozyme sequence known in the art 
may be used, as long as the correct sequence is recognized 
and cleaved. (It is noted that hammerhead ribozyme is 
5 probably not suitable for use.) In a preferred aspect, 
hepatitis delta virus (HDV) ribozyme is used (Perrotta and 
Been, 1991, Nature 350:434-436; Pattnaik et al., 1992, Cell 
69:1011-1020). 

A preferred VSV(-) DNA for use, for insertion of 
10 foreign DNA, is that shown in Fig. 1 and contained in plasmid 
pVSVFL(+), in which a T7 RNA polymerase promoter is present 
5' of the sequence complementary to the 3' end of the VSV 
genome. Plasmid pVSVFL(+) thus comprises (in 5' to 3' order) 
the following operably linked components: the T7 RNA 
15 polymerase promoter, VSV (-) DNA, a DNA sequence that is 

transcribed to produce an HDV ribozyme sequence (immediately 
downstream of the VSV (-) DNA) , and a T7 RNA polymerase 
transcription termination site. A plasmid that can also be 
made and used is plasmiu pVSVSSI, a portion of the sequence 
20 of which is shown in Fig. 2, in which a synthetic DNA 

polylinker, facilitating insertion of foreign DNA, has been 
inserted into pVSVFL(+) between the G and L coding regions. 
The polylinker was synthesized on a DNA synthesizer so as to 
have ends compatible for ligation into an Nhel site, and to 
25 contain the unique restriction enzyme recognition sites Xmal, 
Smal, and NotI, facilitating insertion of foreign DNA 
generated by cleavage with, one of these enzymes or ligated to 
a linker containing a recognition site for one of these 
enzymes (which is then subjected to cleavage prior to 
30 insertion) . 

The foreign DNA encoding an Antigen is inserted 
into any region, or replaces any region, of the 
vesiculovirus (-) DNA that is not essential for vesiculovirus 
replication. In a preferred embodiment > the foreign DNA is 
35 thus inserted into an intergenic region, or a portion of the 
vesiculovirus (-) DNA that is transcribed to form the 
noncoding region of a viral mRNA. In a preferred embodiment, 
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the invention provides a nucleic acid comprising the DNA 
sequence of plasmid pVSVFL(+) as depicted in Figure 1 from 
nucleotide numbers 623-12088 (a portion of SEQ ID NO:l), in 
which a region nonessential for vesiculovirus replication has 
5 been inserted into or replaced by foreign DNA. 

Vesiculoviruses have a defined intergenic 
structure. Extensive homologies are found around the 
intergenic dinucleotides (Fig. 3). These regions have the 
common structure ( 3 ' ) AUACUUUUUuTJT^UUGUCNNUAG ( 5 ' ) 

10 (SEQ ID NO.-34), in which N indicates any nucleotide (thus 
three variable positions are present) and the intergenic 
dinucleotide is underlined. These dinucleotide spacers are 
GA, except at the NS-M junction, where the dinucleotide is 
CA. The first 11 nucleotides of the common sequence are 

15 complementary to the sequence (5') . . . UAUGAAAAAAA . . . 
(3') (SEQ ID NO: 35) that occurs at the 

mRNA-polyadenylate[poly(A)] junction in each mRNA including 
L. Reiterative copying of the U residues by the VSV 
Zz,lyjL*LC*e presumably generates ±h«; poly ;A) tail on each mRNA 

20 (McGeoch, 1979, Cell 17:3199; Rose, 1980, Cell 19:415; 
Schubert et al., 1980, J. Virol. 34:550). The sequence 
complementary, to the 5' end of the mRNA follows the 
intergenic dinucleotide. The L mRNA also terminates with the 
sequence UAUG-poly(A) encoded by the sequence ( 3 ' ) AUACUUUUUUU 

25 (SEQ ID NO: 36) and is presumably also polyadenylated by a 
polymerase "slippage" mechanism (Schubert et al., 1980, J. 
Virol. 34:550; Schubert and Lazarini, 1981, J. Virol. 
38:256). 

Thus, intergenic regions in vesiculovirus (-) DNA 
30 consist of three parts, triggering transcriptional 

termination and reinitiation present both 5' and 3' to each 
gene (presented as the 5 ' to 3' sequence of the positive 
sense strand of vesiculovirus (-) DNA): (a) TATGAAAAAAA 
(SEQ ID N0:37), followed by (b) the dinucleotide GT or CT, 
35 followed by (c) AACAG. Therefore, in a preferred aspect, 

foreign DNA encoding an Antigen can readily be expressed as a 
nonfusion protein from intergenic regions, simply by ensuring 
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that this three-part intergenic region is reconstituted — 
i.e., that this intergenic region appears 5' and 3' to the 
foreign DNA and also 5' and 3' to the adjacent genes. For 
example, in a preferred embodiment, DNA consisting of (a). 

,5 this three-part intergenic region, fused to (b) foreign DNA 
coding for a desired Antigen (preferably including the 
Antigen gene's native start and stop codons for initiation), 
is inserted into a portion of the vesiculovirus (-) DNA that 
is transcribed to form the 3' noncoding region of any 

10 vesiculovirus mRNA. In a particularly preferred aspect, the 
foreign DNA is inserted in the noncoding region between G and 
L. 

In an alternative embodiment, the foreign DNA can 
be inserted into the G gene, so as to encode a fusion protein 
15 with G, for resultant surface display of the Antigen on the 
vesiculovirus particle. Selection should be undertaken to 
ensure that the foreign DNA insertion does not disrupt G 
protein function. 

In a preferred embodiment, an Antigen expressed by 
20 a recombinant vesiculovirus is all or a portion of an 

envelope glycoprotein of a virus other than a vesiculovirus. 
Such an Antigen can replace the endogenous vesiculovirus G 
protein in the vesiculovirus, or can be expressed as a fusion 
with the endogenous G protein, or can be expressed in 

25 addition to the endogenous G protein either as a fusion or 
nonfusion protein. In a specific embodiment, such an Antigen 
forms a part of the vesiculovirus envelope and thus is 
surface-displayed in the vesiculovirus particle. By way of 
example, gpl60 or a fragment thereof of Human 

30 Immunodeficiency Virus can be the Antigen, which is cleaved 
to produce gpi20 and g P 41 (see Owens and Rose, 1993, J. 
Virol. 67(1): 360-365). In a specific embodiment, the G gene 
of VSV in the VSV (-) DNA of plasmid pVSVFL(+) can be easily 
excised and replaced, by cleavage at the Nhel and Mlul sites 

35 flanking the G gene and insertion of the desired sequence. 
In another specific embodiment, the Antigen is a foreign 
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envelope glycoprotein or portion thereof that is expressed as 
a fusion protein comprising the cytoplasmic domain (and, 
optionally, also the transmembrane region) of the native 
vesiculovirus G protein (see Owens and Rose, 1993, J. Virol. 
5 67(l):360-365). Such a fusion protein can replace or be 
expressed in addition to the endogenous vesiculovirus G 
protein. As shown by way of example in Section 6 below, the 
entire native G coding sequence can be replaced by a coding 
sequence of a different G to produce recombinant replicable 

10 vesiculoviruses that express a non-native glycoprotein. 
While recombinant vesiculoviruses that express and display 
epitope(s) of envelope glycoproteins of other viruses can be 
used as live vaccines, such vesiculoviruses also are 
particularly useful as killed vaccines, as well as in the 

15 production of subunit vaccines containing the vesiculovirus- 
produced protein comprising such epitope (s). 

In a specific embodiment, a recombinant 
vesiculovirus of the invention expresses in a host to which 
il i* odittii^stered one or more Anti^er.s. : lv one embodiment, 

20 a multiplicity of Antigens are expressed, each displaying 
different antigenicity or immunogenic ity. 

5.2. DMA SEQU ENCES ENCODING ANTTGFNS 

The invention provides recombinant vesiculoviruses 

25 capable of replication that have a foreign RNA sequence 

inserted into or replacing a site of the genome nonessential 
for replication, wherein the foreign RNA sequence (which is 
in the negative sense) directs the production of an Antigen 
capable of being expressed in a host infected by the 

30 recombinant virus. This recombinant genome is originally 
produced by insertion of foreign DNA encoding the Antigen 
into the vesiculovirus (-) DNA. Any DNA sequence which 
encodes an immunogenic (capable of provoking an immune 
response) Antigen, which produces prophylactic or therapeutic 

35 immunity against a disease or disorder, when expressed as a 
fusion or, preferably, nonfusion protein in a recombinant 
vesiculovirus of the invention, alone or in combination with 
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other Antigens expressed by the same or a different 
vesiculovirus recombinant, can be isolated for use in the 
vaccine formulations of the present invention. 

In a preferred embodiment, expression of an Antigen 
5 by a recombinant vesiculovirus induces an immune response 
against a pathogenic microorganism. For example, an Antigen 
may display the immunogenicity or antigenicity of an antigen 
found on bacteria, parasites, viruses, or fungi which are 
causative agents of diseases or disorders. in a preferred 
10 embodiment, Antigens displaying the antigenicity or 

immunogenicity of antigens of animal viruses of veterinary 
importance (for example, which cause diseases or disorders in 
non-human animals such as domestic or farm animals, e.g., 
cows, chickens, horses, dogs, cats, etc.) are used. In 
15 another embodiment, Antigens displaying the antigenicity or 
immunogenicity of an antigen of a human pathogen are used. 

To determine immunogenicity or antigenicity by 
detecting binding to antibody, various immunoassays known in 
the art can be used, including but not limited to competitive 
20 and non-competitive assay systems using techniques such as 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoradiometric assays, gel 
diffusion precipitin reactions, immunodiffusion assays, in 
situ immunoassays (using colloidal gold, enzyme or 
25 radioisotope labels, for example), western blots, 

immunoprecipitation reactions, agglutination assays (e.g., 
gel agglutination assays, hemagglutination assays), 
complement fixation assays, immunofluorescence assays, 
protein A assays, and immunoelectrophoresis assays, etc. In 
30 one embodiment, antibody binding is detected by detecting a 
label on the primary antibody, in another embodiment, the 
primary antibody is detected by detecting binding of a 
secondary antibody or reagent to the primary antibody, m a 
further embodiment, the secondary antibody is labelled. Many 
35 means are known in the art for detecting binding in an 

immunoassay and are envisioned for use. In one embodiment 
for detecting immunogenicity, T cell-mediated responses can 
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be assayed by standard methods, e.g., in vitro cytoxicity 
assays or in vivo delayed-type hypersensitivity assays. 

Parasites and bacteria expressing epitopes 
(antigenic determinants) that can be expressed by recombinant 
5 vesiculoviruses (wherein the foreign RNA directs the 
production of an antigen of the parasite or bacteria or a 
derivative thereof containing an epitope thereof) include but 
are not limited to those listed in Table II. 



10 



15 



20 



25 



TABLE II 

PARASITES AND BACTERIA EXPRESSING EPITOPES 
THAT CAN BE EXPRESS B Y RECOMRTM& NT VESICUI£VIRUSES 

PARASITES: 

Plasmodium spp. 
Eimeria spp. 

BACTERIA : 

JT ibrio cholerae 
Streptococcus pneumoniae 
Neisseria mennigitidis 
Neisseria gonorrhoeae 
Corynebacteria diphtheriae 
Clostridium tetani 
Bordetella pertussis 
Haemophilus spp. (e.g., influenzae) 
. Chlamydia spp. 

Enterotoxigenic Escherichia coli 



30 



35 



In another embodiment, the Antigen comprises an 
epitope of an antigen of a nematode, to protect against 
disorders caused by such worms. 

In another specific embodiment, any DNA sequence 
which encodes a Plasmodium epitope, which when expressed by a 
recombinant vesiculovirus, is immunogenic in a vertebrate 
host, can be isolated for insertion into vesiculovirus (-) 
DNA according to the present invention. The species of 
Plasmodium which can serve as DNA sources include but are not 
limited to the human malaria parasites P. falciparum, P. 
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malariae, P. ovale, P. vivax , and the animal malaria 
parasites P. berghei , P. yoelii , p. Jcno^lesi, and P. 
cy^ojnolgi. In a particular embodiment, the epitope to be 
expressed is an epitope of the circumsporozoite (CS) protein 
5 of a species of Plasmodium (Miller et al. , 1986, Science 
234:1349). 

In yet another embodiment, the Antigen comprises a 
peptide of the 0 subunit of Cholera toxin (Jacob et al., 
1983, Proc. Natl. Acad. Sci. USA 80:7611). 

10 Viruses expressing epitopes (antigenic 

determinants) that can be expressed by recombinant 
vesiculoviruses (wherein the foreign RNA directs the 
production of an antigen of the virus or a derivative thereof 
comprising an epitope thereof) include but are not limited to 

15 those listed in Table III, which lists such viruses by family 
for purposes of convenience and not limitation (see 1990, 
Fields Virology, 2d ed. , Fields and Knipe (eds.), Raven ' 
Press, NY) . 



20 



TABLE III 



VIRUSES EXPRESSING EPITOPES THAT CAN 
BE EXPRESSED BY RttP QMBINANT V EST CUT .OVTPTT.QT?g 

I. Picornaviridae 
25 Enteroviruses 

Poliovirus 
Coxsackievirus 
Echovirus 
Rhinoviruses 
Hepatitis A Virus 

30 II. Caliciviridae 

Norwalk group of viruses 

III. Togaviridae and Flaviviridae 

Toga viruses (e.g., Dengue virus) 
Alphaviruses 

Flaviviruses (e.g., Hepatitis c virus) 
Rubella virus 

35 

IV. Coronaviridae 

Corona viruses 
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V. Rhabdoviridae 

Rabies virus 

VI. Filoviridae 

Marburg viruses 
Ebola viruses 

5 

VII. Paramyxoviridae 

Parainfluenza virus 
Mumps virus 
Measles virus 

Respiratory syncytial virus 

10 VIII « Orthomyxoviridae 

Orthomyxoviruses (e.g., Influenza virus) 

IX. Bunyaviridae 

Bunyaviruses 

X. Arenaviridae 

Arenaviruses 

15 

XI. Reoviridae 

Reoviruses 

Rotaviruses 

Orbiviruses 

XII. Act<: u village 

2 0 Human T Cell Leukemia Virus type I 

Human T Cell Leukemia Virus type II 
Human Immunodeficiency Viruses re a 

, type I and type II) 
Simian Immunodeficiency Virus 
Lent i viruses 

XIII. Papoviridae 

25 Polyomaviruses 

Papillomaviruses 
Adenoviruses 

XIV. Parvoviridae 

Parvoviruses 

30 XV. Herpesviridae 

Herpes Simplex Viruses 
Epstein-Barr virus 
Cytomegalovirus 
Varicella-Zoster virus 
Human Herpesvirus-6 

Cercopithecine Herpes virus l (b virus) 

35 XVI. Poxviridae 

Poxviruses 
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XVIII. Hepadnaviridae 

Hepatitis B virus 



.... In specific embodiments, the Antigen encoded by the 
foreign sequences that is expressed upon infection of a host 
by the recombinant vesiculovirus, displays the antigenicity 
or immunogenicity of an influenza virus hemagglutinin 

^ (Genbank accession no. J02132; Air, 1981, Proc. Natl. Acad. 
Sci. USA 78:7639-7643; Newton et al. , 1983, Virology 
128:495-501); human respiratory syncytial virus G 
glycoprotein (Genbank accession no. Z33429; Garcia et al., 
1994, J. Virol.; Collins et al., 1984, Proc. Natl. Acad.' 

^ Sci. USA 81:7683); core protein, matrix protein or other 
protein of Dengue virus (Genbank accession no. M19197; Hahn 
et al., 1988, Virology 162:167-180), measles virus 
hemagglutinin (Genbank accession no. M81899; Rota et al., 
1992, Virology 188:135-142); and herpes simplex virus type 2 

^ glycoprotein gB (Genbank accession no. M14923; Bzik et al., 
1986, Virology 155:322-333). 

In another embodiment, one or more epitopes of the 
fusion protein of respiratory synctyial virus (RSV) can be 
expressed as an Antigen. 

other Antigens that can be expressed by a 
recombinant vesiculovirus include but are not limited to 
those displaying the antigenicity or immunogenicity of the 
following antigens: Poliovirus I VP1 (Emini et al., 1983, 
Nature 304:699); envelope glycoproteins of HIV I (Putney et 
al., 1986, Science 234:1392-1395); Hepatitis B surface 
antigen (Itoh et al., 1986, Nature 308:19; Neurath et al. , 
1986, Vaccine 4:34); Diptheria toxin (Audibert et al., 1981, 
Nature 289:543); streptococcus 24M epitope (Beachey, 1985, 
Adv. Exp. Med. Biol. 185:193); and gonococcal pilin (Rothbard 
^ and Schoolnik, 1985, Adv. Exp. Med. Biol. 185:247). 

In other embodiments, the Antigen expressed by the 
recombinant vesiculovirus displays the antigenicity or 



30 
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immunogenicity of pseudorabies virus g50 (gpD) , pseudorabies 
virus II (gpB) , pseudorabies virus gill (gpc) , pseudorabies 
virus glycoprotein H, pseudorabies virus glycoprotein E, 
transmissible gastroenteritis glycoprotein 195, transmissible 
5 gastroenteritis matrix protein, swine rotavirus glycoprotein 
38, swine parvovirus capsid protein, Serpulina 
hydodysenteriae protective antigen, Bovine Viral Diarrhea 
glycoprotein 55, Newcastle Disease Virus hemagglutinin- 
neuraminidase, swine flu hemagglutinin, or swine flu 
10 neuraminidase. 

In various embodiments, the Antigen expressed by 
the recombinant vesiculovirus displays the antigenicity or 
immunogenicity of an antigen derived from Serpulina 
hyodysenteriae , Foot and Mouth Disease Virus, Hog Colera 
15 Virus, swine influenza virus, African Swine Fever Virus, 
Mycoplasma hyopneumoniae , infectious bovine rhinotracheitis 
virus (e.g., infectious bovine rhinotracheitis virus 
glycoprotein E or glycoprotein G) , or infectious 
laryngotracheitis virus (e.g., infection Vciryngotracheitis 
2 0 virus glycoprotein G or glycoprotein I). 

In another embodiment, the Antigen displays the 
antigenicity or immunogenicity of a glycoprotein of La Crosse 
Virus (Gonzales-Scarano et al., 1982, Virology 120:42), 
Neonatal Calf Diarrhea Virus (Matsuno and Inouye, 1983, 
25 Infection and Immunity 39:155), Venezuelan Equine 

Encephalomyelitis Virus (Mathews and Roehrig, 1982, J. 
Immunol. 129:2763), Punta Toro Virus (Dalrymple et al., 1981, 
in Replication of Negative Strand Viruses, Bishop and Compans 
(eds.), Elsevier, NY, p. 167), Murine Leukemia Virus (Steeves 
30 et al., 1974, J. Virol. 14:187), or Mouse Mammary Tumor Virus 
(Massey and Schochetman, 1981, Virology 115:20). 

In another embodiment, the Antigen displays the 
antigenicity or immunogenicity of an antigen of a human 
pathogen, including but not limited to human herpesvirus, 
35 herpes simplex virus-l, herpes simplex- virus-2 , human 

cytomegalovirus, Epstein-Barr virus, Varicella-Zoster virus, 
human herpesvirus-6 , human herpesvirus-7 , human influenza 
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virus, human immunodeficiency virus, rabies virus, measles 
virus, hepatitis B virus, hepatitis C virus, Plasmodium 
falciparum, and Bordetella pertussis. 

In a specific embodiment of the invention, a 
5 recombinant vesiculovirus expresses hepatitis B virus core 
protein and/or hepatitis B virus surface antigen or a 
fragment or derivative thereof (see, e.g., U.K. Patent 
Publication No. GB 2034323A published June 4, 1980; Ganem and 
Varmus, 1987, Ann. Rev. Biochem. 56:651-693; Tiollais et al 
10 1985, Nature 317:489-495). The HBV genome (subtype adw) is" 
contained in plasmid pAM6 (Moriarty et al., 1981, Proc. Natl 
Acad. Sci. USA 78:2606-2610, available from the American Type 
Culture Collection (ATCC) , Accession No. 45020), a 
pBR322-based vector that is replicable in E. coli. 
15 In anothe r embodiment, the Antigen expressed by the 

recombinant vesiculovirus displays the antigenicity or 
immunogenicity of an antigen of equine influenza virus or 
equine herpesvirus. Examples of such antigens are equine 
influent virusrtype A/Alaska 91 neuraminidase, equine 
20 influenza virus type A/Miami 63 neuraminidase, equine 
influenza virus type A/Kentucky 81 neuraminidase equine 
herpesvirus type l glycoprotein B, and equine herpesvirus 
type l glycoprotein D. 

In another embodiment, the Antigen displays the 
25 antigenicity or immunogenicity of an antigen of bovine 
respiratory syncytial virus or bovine parainfluenza virus. 
For example, such antigens include but are not limited to 
bovine respiratory syncytial virus attachment protein 
(BRSV G) , bovine respiratory syncytial virus fusion protein 
30 (BRSV F) , bovine respiratory syncytial virus nucleocapsid 
protein (BRSV N) , bovine parainfluenza virus type 3 fusion 
protein, and the bovine parainfluenza virus type 3 
hemagglutinin neuraminidase. 

In another embodiment, the Antigen displays the 
35 antigenicity or immunogenicity of bovine. viral diarrhea virus 
glycoprotein 48 or glycoprotein 53. 
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In another embodiment, the Antigen displays the 
antigenicity or immunogenicity of an antigen of infectious 
bursal disease virus. Examples of such antigens are 
infectious bursal disease virus polyprotein and VP 2 . 
5 Potentially useful antigens or derivatives thereof 

for use as Antigens expressed by recombinant vesiculoviruses 
can be identified by various criteria, such as the antigen's 
involvement in neutralization of a pathogen's infectivity 
(Norrby, 1985, Summary, in Vaccines85, Lerner et al. (eds.), 

10 Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 
pp. 388-389), type or group specificity, recognition by 
patients 1 antisera or immune cells, and/or the demonstration 
of protective effects of antisera or immune cells specific 
for the antigen. In addition, the antigen's encoded epitope 

15 should preferably display a small or no degree of antigenic 
variation in time or amongst different isolates of the same 
pathogen. 

In a preferred embodiment, the foreign DNA inserted 
into the vesiculovirus (-} DNA encodes Ln-2.rsniu;;opoucr.c 

20 dominant epitope of a pathogen. Foreign DNA encoding 

epitopes which are reactive with antibody although incapable 
of eliciting immune responses, still have potential uses in 
immunoassays (see Section 5.8, Infra). 

In another embodiment, foreign RNA of the 

25 recombinant vesiculovirus directs the production of an 
Antigen comprising an epitope, which when the recombinant 
vesiculovirus is introduced into a desired host, induces an 
immune response that protects against a condition or disorder 
caused by an entity containing the epitope. For example, the 

30 Antigen can be a tumor specific antigen or tumor-associated 
antigen, for induction of a protective immune response 
against a tumor (e.g., a malignant tumor). Such tumor- 
specific or tumor-associated antigens include but are not 
limited to KS 1/4 pan-carcinoma antigen (Perez and Walker, 

35 1990, J. Immunol. 142:3662-3667; Bumal , 1988, Hybridoma 

7(4) :407-415) ; ovarian carcinoma antigen (CA125) (Yu et al., 
1991, Cancer Res. 51 (2) : 468-475) ; prostatic acid phosphate 
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(Tailor et al., 1990, Nucl. Acids Res. 18 (16) : 4928) ; prostate 
specific antigen (Henttu and Vihko, 1989, Biochem. Biophys. 
Res. comm. 160 (2) : 903-910; Israeli et al. , 1993, Cancer Res. 
53:227-230; melanoma-associated antigen p97 (Estin et al., 
5 1989, J. Natl, cancer Instit. 81 (6) : 445-446) ; melanoma 
antigen gp75 (Vi jayasardahl et al. , 1990, J. Exp. Med. 
171(4) :1375-1380); high molecular weight melanoma antigen 
(Natali et al., 1987, Cancer 59:55-63); and prostate specific 
membrane antigen. 

In another embodiment of the invention, the Antigen 
expressed by the recombinant vesiculovirus comprises large 
regions of proteins which contain several B cell epitopes 
(i.e., epitopes capable of enticing a humoral immune 
response) and T cell epitopes (i.e., epitopes capable of 
15 inducing a cell-mediated immune response) . 

Peptides or proteins which are known to contain 
antigenic determinants can be used as the Antigen. If 
specific desired antigens are unknown, identification and 
character iza Lien of immunoreactive sequences can be carried 
20 out. one way in which to accomplish this is through the use 
of monoclonal antibodies generated to the surface or other 
molecules of a pathogen or tumor, as the case may be. The 
peptide sequences capable of being recognized by the 
antibodies are defined epitopes. Alternatively, small 
25 synthetic peptides conjugated to carrier molecules can be 
■ tested for generation of monoclonal antibodies that bind to 
the sites corresponding to the peptide, on the intact 
molecule (see, e.g., Wilson et al., 1984, Cell 37:767). 

In a specific embodiment, appropriate Antigens, 
30 including fragments or derivatives of known antigens, can be 
identified by virtue of their hydrophilicity , by carrying out 
a hydrophilicity analysis (Hopp and Woods, 1981, Proc. Natl. 
Acad. Sci. USA 78:3824) to generate a hydrophilicity profile. 
A hydrophilicity profile can be used to identify the 
35 hydrophobic and hydrophilic regions of a protein and the 

corresponding regions of the gene sequence which encode such 
proteins. Hydrophilic regions are predicted to be 
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immunogenic/antigenic. other methods known in the art which 
may be employed for the identification and characterization 
of antigenic determinants are also within the scope of the 
invention. 

5 The foreign DNA encoding the Antigen, that is 

inserted into a non-essential site of the vesiculovirus (-) 
DNA, optionally can further comprise a foreign DNA sequence 
encoding a cytokine capable of being expressed and 
stimulating an immune response in a host infected by the 

10 recombinant vesiculovirus. For example, such cytokines 

include but are not limited to interleukin-2, inter leukin-6 , 
interleukin-12, interferons, granulocyte-macrophage colony 
stimulating factors, and inter leukin receptors. 

The foreign DNA optionally can further comprise a 

15 sequence encoding and capable of expressing a detectable 
marker (e.g., 0 galactosidase) . 

5.3. CONSTRUCTION OF VESICULOVIRUS (-) DNA 
CONTAINING FOREIGN DNA 

2 0 For initial production of a recombinant 

vesiculovirus, the foreign DNA comprising a sequence encoding 
the desired antigen is inserted into and/or replaces a region 
of the vesiculovirus (-) DNA nonessential for replication. 
Many strategies known in the art can be used in the 
25 construction of the vesiculovirus (-) DNA containing the 
foreign DNA. For example, the relevant sequences of the 
foreign DNA and of the vesiculovirus (-) dna can, by 
techniques known in the art, be cleaved at appropriate sites 
with restriction endonuclease (s) , isolated, and ligated in 
30 vit ro. If cohesive termini are generated by restriction 

endonuclease digestion, no further modification of DNA before 
ligation may be needed. If, however, cohesive termini of the 
DNA are not available for generation by restriction 
endonuclease digestion, or different sites other than those 
35 available are preferred, any of numerous techniques known in 
the art may be used to accomplish ligation of the 
heterologous DNA at the desired sites, in a preferred 
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embodiment, a desired restriction enzyme site is readily 
introduced into the desired DNA by amplification of the DNA 
by use of PCR with primers containing the restriction enzyme 
site. B y way of another example, cleavage with a restriction 
5 enzyme can be followed by modification to create blunt ends 
. by digesting back or filling in single-stranded DNA termini 
before ligation. Alternatively, the cleaved ends of the 
vesiculovirus (-) DNA or foreign DNA can be "chewed back- 
using a nuclease such as nuclease Bal 31, exonuclease III 
10 lambda exonuclease, mung bean nuclease, or T4 DNA polymerase 
exonuclease activity, to name but a few, in order to remove 
portions of the sequence. 

To facilitate insertion of the foreign DNA, an 
oligonucleotide sequence (a linker) which encodes one or more 
15 restriction sites can be inserted in a region of the 

vesiculovirus (-) DNA (see, e.g., the polylinker in pVSVSSl 
Fig. 2) by ligation to DNA termini. A linker may also be 
used to generate suitable restriction sites in the foreign 
DNA sequence. 

20 Additionally, vesiculovirus (-) DNA or foreign DNA 

sequences can be mutated in vitro or in vivo in order to form 
new restriction endonuclease sites or destroy preexisting 
ones, to facilitate in vitro ligation procedures. Any 
technique for mutagenesis known in the art can be used 
25 including but not limited to, in vitro site-directed 
mutagenesis (Hutchinson et al., 1978, J. Biol. Chem. 
253:6551), chemical mutagenesis, etc. 

Sequences of the vesiculovirus (-) DNA that have 
been undesirably modified by such in vitro manipulations can 
30 be -restored," if desired, by introduction of appropriate 
sequences at the desired sites. 

The particular strategy for inserting the foreign 
DNA will depend on the specific vesiculovirus (-) DNA site to 
be replaced or inserted into, as well as the foreign DNA to 
35 be inserted. 
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The sequences encoding the immunogenic peptides or 
proteins are preferably present in single copies, but can 
also be present in multiple copies within the virus genome. 

Formation of the desired vesiculovirus (-) DNA 
5 containing the foreign DNA can be confirmed by standard 
methods such as DNA sequence analysis, hybridization 
analysis, and/or restriction mapping, using methods well 
known in the art. 

Foreign DNA encoding a desired antigen can be 
10 obtained from any of numerous sources such as cloned DNA 

genomic DNA, or cDNA made from RNA of the desired pathogen or 
tumor, as the case may be, or chemically synthesized DNA, and 
manipulated by recombinant DNA methodology well known in the 
art (see Sambrook et al., 1991, Molecular Cloning A 
15 Laboratory Manual, 2d ed. , Cold Spring Harbor Laboratory 
Press, New York), in a preferred embodiment, polymerase 
chain reaction (PCR) is used to amplify the desired fragment 
of foreign DNA from among a crude preparation of DNA or a 
small sample of the DNA, by standard methods. «ppio^i 6l « 
20 primers for use in PCR can be readily deduced based on 
published sequences. 

In order to generate appropriate DNA fragments, the 
DNA (e.g., from the pathogen or tumor of interest) may be 
cleaved at specific sites using various restriction enzymes. 
25 Alternatively, one may use DNasel in the presence of 
manganese, or mung bean nuclease (McCutchan et al., 1984 
Science 225:626), to fragment the DNA, or the DNA can be' 
Physically sheared, as for example, by sonication. The 
linear DNA fragments can then be separated according to size 
30 by standard techniques, including, but not limited to 
agarose and polyacrylamide gel electrophoresis and column 
chromatography . 

PCR amplification of DNA fragments containing the 
desired epitope (s) is most preferably carried out, in which 
35 the PCR primers contain and thus introduce into the amplified 
DNA a desired restriction enzyme recognition site. 
Alternatively, any restriction enzyme or combination of 
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restriction enzymes may be used to generate DNA fragment (s) 
containing the desired epitope (s) , provided the enzymes do 
not destroy the immunopotency of the encoded product. 
Consequently, many restriction enzyme combinations may be 
5 used to generate DNA fragments which, when inserted into the - 
vesiculovirus (-) DNA, are capable of producing recombinant 
vesiculoviruses that direct the production of the peptide 
containing the epitope (s). 

Once the* DNA fragments are generated, 
10 identification of the specific fragment containing the 
desired sequence may be accomplished in a number of ways. 
For example, if a small amount of the desired DNA sequence or 
a homologous sequence is previously available, it can be used 
as a labeled probe (e.g., nick translated) to detect the DNA 
15 fragment containing the desired sequence, by nucleic acid 

hybridization. Alternatively, if the sequence of the derived 
gene or gene fragment is known, isolated fragments or 
portions thereof can be sequenced by methods known in the 
art, and identified by a -comparison of the derived sequence 
2 0 to that of the known DNA or protein sequence. Alternatively, 
the desired fragment can be identified by techniques 
including but not limited to mRNA selection, making cDNA to 
the identified mRNA, chemically synthesizing the gene 
sequence (provided the sequence is known) , or selection on 
25 the basis of expression of the encoded protein (e.g., by 
antibody binding) after "shotgun cloning" of various DNA 
fragments into an expression system. 

The sequences encoding peptides to be expressed in 
recombinant vesiculoviruses according to the present 
30 invention, whether produced by recombinant DNA methods, 

chemical synthesis, or purification techniques, include but 
are not limited to sequences encoding all or part (fragments) 
of the amino acid sequences of pathogen-specific and 
tumor-specific antigens, as well as other derivatives and 
35 analogs thereof displaying the antigenicity or immunogenicity 
thereof. Derivatives or analogs of antigens can be tested 
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for the desired activity by procedures known in the art, 
including but not limited to standard immunoassays. 

In particular, antigen derivatives can be made by 
altering the encoding antigen nucleotide sequences by 
5 substitutions, additions or deletions that do not destroy the 
antigenicity or immunogenicity of the antigen. For example, 
due to the degeneracy of nucleotide coding sequences, other 
DNA sequences which encode substantially the same amino acid 
sequence as a native antigen gene or portion thereof may be 
10 used in the practice of the present invention. Other 
examples may include but are not limited to nucleotide 
sequences comprising all or portions of genes or cDNAs which 
are altered by the substitution of different codons that 
encode a functionally equivalent amino acid residue within 
15 the sequence, thus producing a silent change. For example, 
one or more amino acid residues within the sequence can be 
substituted by another amino acid of a similar polarity which 
acts as a functional equivalent, resulting in a silent 
alteration. Substitutes for an amino acid within un* 
2 0 sequence may be selected from other members of the class to 
which the amino acid belongs. For example, the nonpolar 
(hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine. The polar neutral amino acids include glycine, 
25 serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids 
include arginine, lysine, and histidine. The negatively 
charged (acidic) amino acids include aspartic and glutamic 
acid. 

30 The antigen derivatives and analogs can be produced 

by various methods known in the art. For example, a cloned 
gene sequence can be modified by any of numerous strategies 
known in the art (Sambrook et al., 1989, Molecular Cloning, A 
Laboratory Manual, 2d ed. , Cold Spring Harbor Laboratory, 

35 Cold Spring Harbor, New York). The sequence can be cleaved 
at appropriate sites with restriction endonuclease (s) , 
followed by further enzymatic modification if desired, 
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isolated, and ligated in vitro. In the production of the 
gene encoding a derivative or analog of an antigen, care 
should be taken to ensure that the modified gene remains 
within the same translational reading frame as the antigen, 
5 uninterrupted by translational stop signals, in the gene 
region where the desired epitope (s) are encoded. 

Additionally, the antigen-encoding nucleic acid 
sequence can be mutated in vitro or in vivo, to create and/ or 
destroy translation, initiation, and/or termination 

10 sequences, or to create variations in coding regions and/or 
form new restriction endonuclease sites or destroy 
preexisting ones, to facilitate further in vitro 
modification. Any technique for mutagenesis known in the art 
can be used, including but not limited to, in vitro site- 

15 directed mutagenesis (Hutchinson, C. , et al., 1978, J. Biol. 
Chem 253:6551), use of TAB® linkers (Pharmacia), etc. 

In another specific embodiment, the encoded antigen 
derivative is a chimeric, or fusion, protein comprising a 
first protein or fragment thereof fused to a second, 

20 different amino acid sequence. Such a chimeric protein is 
encoded by a chimeric nucleic acid in which the two coding 
sequences are joined inframe. Such a chimeric product can be 
made by ligating the appropriate nucleic acid sequences 
encoding the desired amino acid sequences to each other by 

25 methods known in the art, in the proper coding frame. In a 
specific embodiment, a fusion protein is produced in which 
the first protein sequence contains an epitope of an antigen, 
and the second protein sequence contains an epitope of a 
different antigen. 

30 Derivatives and fragments of known antigens can be 

readily tested by standard immunoassay techniques to 
ascertain if they display the desired immunogenicity or 
antigenicity, rendering a DNA sequence encoding such a 
fragment or derivative suitable for insertion into the 

35 vesiculovirus (-) DNA. 

A DNA sequence encoding an epitope that is a 
hapten, i.e., a molecule that is antigenic in that it can 
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react selectively with cognate antibodies, but not 
immunogenic in that it cannot elicit an immune • response when 
administered without adjuvants or carrier proteins, can also 
be isolated for use, since it is envisioned that, in 
5 particular embodiments, presentation by the vesiculoviruses 
of the invention can confer immunogenicity to the hapten 
expressed by the virus. 

Once identified and isolated, the foreign DNA 
containing the sequence (s) of interest is then inserted into 
10 the vesiculovirus (-) DNA, for production of a recombinant 
vesiculovirus. 



5 -4. PRODUCTI ON OF RECOMBINANT VESICULOVIRUSES 

The recombinant vesiculoviruses of the invention 
15 are produced by providing in an appropriate host cell: 
vesiculovirus (-) DNA, in which regions nonessential for 
replication have been inserted into or replaced by foreign 
DNA comprising a sequence encoding an Antigen, and 
recombinant sources of vesiculovirus N protein, P protein, 
20 and L protein. The production is preferably in vitro, in 
cell culture. 

The host cell used for recombinant vesiculovirus 
production can be any cell in which vesiculoviruses grow, 
e.g., mammalian cells and some insect (e.g., Drosophila) 

25 cells. Primary cells, or more preferably, cell lines can be 
used. A vast number of cell lines commonly known in the art 
are available for use. By way of example, such cell lines 
include but are not limited to BHK (baby hamster kidney) 
cells, CHO (Chinese hamster ovary) cells, HeLA (human) cells, 

30 mouse L cells, Vero (monkey) cells, ESK-4 , PK-15, EMSK cells, 
MDCK (Madin-Darby canine kidney) cells, MDBK (Madin-Darby 
bovine kidney) cells, 293 (human) cells, and Hep-2 cells. 

The sources of N, P, and L proteins can be the same 
or different recombinant nucleic acid(s) , encoding and 

35 capable of expressing the N, P and L proteins in the host 
cell in which it is desired to produce recombinant 
vesiculovirus. 
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The nucleic acids encoding the N, P and L proteins 
are obtained by any means available in the art. The N, P and 
L nucleic acid sequences have been disclosed and can be used. 
For example, see Genbank accession no. J02428; Rose and 
5 Schubert, 1987, in The Viruses: The Rhabdovlruses , Plenum 
Press, NY, pp. 129-166. The sequences encoding the N, P and 
L genes can also be obtained from plasmid pVSVFL(+) , 
deposited with the ATCC and assigned accession no. 97134, 
e.g., by PCR amplification of the desired gene (PCR; U.S. 
10 Patent Nos. 4,683,202, 4,683,195 and 4,889,818; Gyllenstein 
et al., 1988, Proc. Natl. Acad. Sci . USA 85:7652-7656; Ochman 
et al., 1988, Genetics 120:621-623; Loh et al., 1989, Science 
243:217-220). If a nucleic acid clone of any of the N, P or 
L genes is not already available, the clone can be obtained 
15 by use of standard recombinant DNA methodology. For example, 
the DNA may be obtained by standard procedures known in the 
art by purification of rna from vesiculoviruses followed by 
reverse transcription and polymerase chain reaction (Mullis 
and Faloona, 1987, Methods in Enz^ology 155:335-350) . 
2 0 Alternatives to isolating an N, P or L gene include, but are 
not limited to, chemically synthesizing the gene sequence 
itself. other methods are possible and within the scope of 
the invention. 

If desired, the identified and isolated gene can 
25 then optimally be inserted into an appropriate cloning vector 
prior to transfer to an expression vector. 

Nucleic acids that encode derivatives (including 
fragments) and analogs of native N , P and L genes, as well as 
derivatives and analogs of the vesiculovirus (-) DNA can also 
30 be used in the present invention, as long as such derivatives 
and analogs retain function, as exemplified by the ability 
when used according to the invention to produce a replicable 
vesiculovirus containing a genomic RNA containing foreign 
RNA. In particular, derivatives can be made by altering 
35 sequences by substitutions, additions, or deletions that 

provide for functionally active molecules. Furthermore, due 
to the inherent degeneracy of nucleotide coding sequences, 
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other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be used in 
the practice of the methods of the invention. Amino acid 
substitutions may be made on the basis of similarity in 
.5 polarity, charge, solubility, hydrophobicity , hydrophilicity 
and/ or the amphipathic nature of the residues involved. 

. The desired N/P/L-encoding nucleic acid is then 
preferably inserted into an appropriate expression vector, 
i.e., a vector which contains the necessary elements for the 

10 transcription and translation of the inserted protein-coding 
sequence in the host in which it is desired to produce 
recombinant vesiculovirus, to create a vector that functions 
to direct the synthesis of the N/P/L protein that will 
subsequently assemble with the vesiculovirus genomic RNA 

15 containing the foreign sequence (produced in the host cell 
from antigenomic vesiculovirus ( + ) RNA produced by 
transcription of the vesiculovirus (-) DNA) . A variety of 
vector systems may be utilized to express the N, P and 
L-~Mrr.i, sequences, as well as to transcribe the 

20 vesiculovirus (-) DNA containing the foreign DNA, as long as 
the vector is functional in the host and compatible with any 
other vector present. Such vectors include but are not 
limited to bacteriophages, plasmids, or cosmids. In a 
preferred aspect, a plasmid expression vector is used. The 

25 expression elements of vectors vary in their strengths and 
specificities. Any one of a number of suitable transcription 
and translation elements may be used, as long as they are 
functional in the host. 

Standard recombinant DNA methods may be used to 

30 construct expression vectors containing DNA encoding the N , 
P, and L proteins, and the vesiculovirus (-) DNA containing 
the foreign DNA, comprising appropriate transcriptional/ 
translational control signals (see, e.gr., Sambrook et al., 
1989, supra, and methods described hereinabove). 

35 (Translational control signals are not needed for 
' transcription of the vesiculovirus (-) DNA, and thus may be 
omitted from a vector containing the vesiculovirus (-) DNA, 
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although such signals may be present in the vector and 
operably linked to other sequences encoding a protein which 
it is desired to express). Expression may be controlled by 
any promoter/ enhancer element known in the art. Promoters 
5 which may be used to control expression can be constitutive 
or inducible. In a specific embodiment, the promoter is an 
RNA polymerase promoter. 

Transcription termination signals (downstream of 
the gene) , and selectable markers are preferably also 

10 included in a plasmid expression vector. In addition to 
promoter sequences, expression vectors for the N, P, and L 
proteins preferably contain specific initiation signals for 
efficient translation of inserted N/P/L sequences, e.g., a 
ribosome binding site. 

15 Specific initiation signals are required for 

efficient translation of inserted protein coding sequences. 
These signals include the ATG initiation codon and adjacent 
sequences. In cases where the entire N, P, or L gene 
ihcluuxng its own initiation coaor. ^.nd adjacent sequences are 

2 0 inserted into the appropriate vectors, no additional 

translational control signals may be needed. However, in 
cases where only a portion of the gene sequence is inserted, 
exogenous translational control signals, including the ATG 
initiation codon, must be provided. The initiation codon ; 

25 must furthermore be in phase with the reading frame of the 
protein coding sequences to ensure translation of the entire 
insert. These exogenous translational control signals and 
initiation codons can be of a variety of origins, both 
natural and synthetic. 

30 In a specific embodiment, a recombinant expression 

vector provided by the invention, encoding an N, P, and/or L 
protein or functional derivative thereof, comprises the 
following operatively linked . components: a promoter which 
controls the expression of the N, P, or L protein or 
35 functional derivative thereof, a translation initiation 
signal, a DNA sequence encoding the N, P or L protein or 
functional derivative thereof, and a transcription 
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termination signal. In a preferred aspect, the above 
components are present in 5' to 3' order as listed above. 

In another specific embodiment, the gene encoding 
the N, P, or L protein is inserted downstream of the T7 RNA 
5 polymerase promoter from phage T7 gene 10, situated with an A 
in the -3 position. A T7 RNA polymerase terminator and a 
replicon are also included in the expression vector. In this 
embodiment, T7 RNA polymerase is provided to transcribe the 
N/P/L sequence. The T7 RNA polymerase can be produced from a 

10 chromosomally integrated sequence or episomally, and is most 
preferably provided by intracellular expression from a 
recombinant vaccinia virus encoding the T7 RNA polymerase 
(see infra). Preferably, the N, P, and L proteins are each 
encoded by a DNA sequence operably linked to a promoter in an 

15 expression plasmid, containing the necessary regulatory 

signals for transcription and translation of the N, P, and L 
proteins. Such an expression plasmid preferably includes a 
promoter, the coding sequence, and a transcription 
te^air**.* r>r./polyadenylation signal, and optionally, a 

20 selectable marker (e.g., /3-galactosidase) . The N, P and L 
proteins can be encoded by the same or different plasmids, or 
a combination thereof, and preferably are in different 
plasmids. Less preferably, one or more of the N, P, and L 
proteins can be expressed intrachromosomally . 

25 The cloned sequences comprising the vesiculovirus 

(-) DNA containing the foreign DNA, and the cloned sequences 
comprising sequences encoding the N, P, and L proteins can be 
introduced into the desired host cell by any method known in 
the art, e.g., transf ection, electroporation, infection (when 

30 the sequences are contained in, e.g., a viral vector), 
microinjection, etc. 

^ In a preferred embodiment, DNA comprising 

vesiculovirus (-) DNA containing foreign DNA encoding an 
Antigen, operably linked to an RNA polymerase promoter 

35 (preferably a bacteriophage RNA polymerase promoter) ; DNA 
encoding N, operably linked to the same RNA polymerase 
promoter; DNA encoding P, operably linked to the same 
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polymerase promoter; and DNA encoding L, operably linked to 
the same polymerase promoter; are all introduced (preferably 
by transfection) into the same host cell, in which host cell 
the RNA polymerase has been cytoplasmically provided. The 
5 RNA polymerase is cytoplasmically provided preferably by 
expression from a recombinant virus that replicates in the 
cytoplasm and expresses the RNA polymerase, most preferably a 
vaccinia virus (see the section hereihbelow) , that has been 
introduced (e.g., by infection) into the same host cell. 

10 Cytoplasmic provision of RNA polymerase is preferred, since 
this will result in cytoplasmic transcription and processing, 
of the VSV (-) DNA comprising the foreign DNA and of the N, P 
and L proteins, avoiding splicing machinery in the cell 
nucleus, and thus maximizing proper processing and production 

15 of N, P and L proteins, and resulting assembly of the 

recombinant vesiculovirus. For example, vaccinia virus also 
cytoplasmically provides enzymes for processing (capping and 
polyadenylation) of mRNA, facilitating proper translation. 
In a most preferred aspect, rv RNA polymerase promoters are 

2 0 employed, and a cytoplasmic source of T7 RNA polymerase is 
provided by also introducing into the host cell a recombinant 
vaccinia virus encoding T7 RNA polymerase into the host cell. 
Such vaccinia viruses can be obtained by well known methods 
(see section 5.5, infra). In a preferred aspect, a 

25 recombinant vaccinia virus such as VTF7-3 (Fuerst et al., 
1986, Proc. Natl. Acad. Sci. U.S.A. 83:8122-8126) can be 
used. In a most preferred aspect, the DNA comprising 
vesiculovirus (-) DNA containing foreign DNA is plasmid 
pVSVSSl in which foreign DNA has been inserted into the 

30 polylinker region. 

Alternatively, but less preferably, the RNA 
polymerase (e.g., T7 RNA polymerase) can be provided by use 
of a host cell that expresses T7 RNA polymerase from a 
chromosomal ly integrated sequence (e.g., originally inserted 

35 into the chromosome by homologous recombination) , preferably 
constitutively , or that expresses T7 RNA polymerase 
episomally, from a plasmid. 
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In another, less preferred, embodiment, the VSV (-) 
DNA encoding an Antigen, operably linked to a promoter, can 
be transfected into a host cell that stably recombinantly 
expresses the N, P, and L proteins from chromosomally 
5 integrated sequences. 

The cells are cultured and recombinant 
vesiculovirus is recovered, by standard methods. For 
example, and not by limitation, after approximately 24 hours, 
cells and medium are collected, freeze- thawed , and the . 
10 lysates clarified to yield virus preparations. 

Alternatively, the cells and medium are collected and simply 
cleared of cells and debris by low-speed centrifugation. 

Confirmation that the appropriate foreign sequence 
is present in the genome of the recombinant vesiculovirus and 
15 directs the production of the desired protein (s) in an 
infected cell, is then preferably carried out. standard 
procedures known in the art can be used for this purpose. 
For example, genomic RNA is obtained from the vesiculovirus 
■■■■■■i/jf duo -pncnol extraction from- virus -r*r«r»*rat ions', - -and can be 
20 subjected to reverse transcription (and PCR, if desired), 
followed by sequencing, Southern hybridization using a probe 
specific to the foreign DNA, or restriction enzyme mapping, 
etc. The virus can be used to infect host cells, which can 
then be assayed for expression of the desired protein by 
25 standard immunoassay techniques using an antibody to the 
protein, or by assays based on functional activity of the 
protein. Other techniques are known in the art and can be 
used. 

The invention also provides kits for production of 
30 recombinant vesiculoviruses. In one embodiment, the kit 
comprises in one or more (and most preferably, in separate) 
containers: (a) a first recombinant DNA that can be 
transcribed in a suitable host cell to produce a 
vesiculovirus antigenomic (+) rna in which a portion of the 
35 RNA nonessential for replication of the vesiculovirus has 
been inserted into or replaced by a foreign RNA sequence; (b) 
a second recombinant DNA comprising a sequence encoding a 
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vesiculovirus N protein; (c) a third recombinant DNA 
comprising a sequence encoding a vesiculovirus L protein; and 
(d) a fourth recombinant DNA comprising a sequence encoding a 
vesiculovirus P protein. The second, third and fourth 
5 recombinant DNAs can be part of the same or different DNA 
molecules. In a preferred embodiment, the sequences encoding 
the N, L, and P proteins are each operably linked to a 
promoter that controls expression of the N, L, and P 
proteins, respectively, in the suitable host cell. In 
10 various embodiments, the kit can contain the various nucleic 
acids, e.g., plasmid expression vectors, described 
hereinabove for use in production of recombinant 
vesiculoviruses. 

In another embodiment, a kit of the invention 
15 comprises (a) a first recombinant DNA that can be transcribed 
in a suitable host cell to produce a vesiculovirus 
antigenomic DNA in which a portion of the RNA nonessential 
for replication of the vesiculovirus has been inserted into 
or replaced by a foreign RNA sequence; ai.d (b) a host cell 
20 that recombinant ly expresses vesiculovirus N, P and L 
proteins . 

In a preferred embodiment, a kit of the invention 
comprises in separate containers: 

(a) a first plasmid comprising the following 
25 operatively linked components: (i) a bacteriophage RNA 

polymerase promoter, (ii) a DNA comprising a sequence capable 
of being transcribed in a suitable host cell to produce an 
RNA molecule comprising a vesiculovirus antigenomic RNA in 
which a portion of the RNA nonessential for replication of 

3 0 the vesiculovirus has been inserted into or replaced by a 
foreign RNA sequence, and in which the 3' end of the 
antigenomic RNA is immediately adjacent to a ribozyme 
sequence that cleaves at the 3' end of the antigenomic RNA, 
and (iii) a transcriptional termination signal for the 

35 bacteriophage RNA polymerase; and 

(b) a second plasmid comprising the following 
operatively linked components: (i) the bacteriophage RNA 
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polymerase promoter, (ii) a DNA comprising a sequence 
encoding the vesiculovirus N protein, and (ii) a 
transcriptional termination signal for the bacteriophage RNA 
polymerase; and 

5 (c) a third plasmid comprising the following 

operatively linked components: (i) the bacteriophage RNA 
polymerase promoter, (ii) a DNA comprising a sequence 
encoding the vesiculovirus P protein, and (ii) a 
transcriptional termination signal for the bacteriophage RNA 
10 polymerase; and 

(d) a fourth plasmid comprising the following 
operatively linked components: (i) the bacteriophage RNA 
polymerase promoter, (ii) a DNA comprising a sequence 
encoding the vesiculovirus L protein, and (ii) a 
15 transcriptional termination signal for the bacteriophage RNA 
polymerase. 

In another embodiment, a kit of the invention 
further comprises in a separate container a recombinant 
vtiucima viius encoding and capable of c::prcising the 
20 bacteriophage RNA polymerase. 

In a preferred embodiment, the components in the 
containers are in purified form. 

5.4.1. RECOMBINANT VACCINIA VIRUSES 
25 ENCODING AND CAPABLE OF 

EXPRESSING FOREIGN RNA POLYMERASES 

In a preferred aspect of the invention, 
transcription of the vesiculovirus (-) DNA containing the 
foreign DNA encoding an Antigen, and/ or transcription of the 
DNA encoding the N, P, and L proteins in the host cell, is 
controlled by an RNA polymerase promoter (preferably one in 
which the RNA polymerase is not endogenous to the host cell) , 
and the RNA polymerase (that initiates transcription from the 
promoter) is recombinantly provided in the host cell by 
expression from a recombinant vaccinia virus. DNA sequences 
encoding RNA polymerases are well known and available in the 



30 



35 
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art and can be used. For example, phage DNA can be obtained 
and PCR used to amplify the desired polymerase gene. 

Insertion of the desired recombinant DNA sequence 
encoding and capable of expressing the RNA polymerase into a 
5 vaccipia virus for expression by the vaccinia virus is 

preferably accomplished by first inserting the DNA sequence 
into a plasmid vector which is capable of subsequent transfer 
to a vaccinia virus genome by homologous recombination. 
Thus, in a preferred aspect of the invention for constructing 
10 the recombinant vaccinia viruses, the desired DNA sequence 
encoding the polymerase is inserted, using recombinant DNA 
methodology (see Sambrook et al., 1989, Molecular Cloning, A 
Laboratory Manual, 2d ed. , Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York) into an insertion (preferably, 
15 plasmid) vector flanked by (preferably) nonessential vaccinia 
DNA sequences, thus providing for subsequent transfer of its 
chimeric gene(s) into vaccinia virus by homologous 
recombination. The sequences are placed in the vector such 
that they can be expressed under the control of a promoter 
2 0 functional in vaccinia virus. 

Expression of foreign DNA in recombinant vaccinia 
viruses requires the positioning of promoters functional in 
vaccinia so as to direct the expression of the protein-coding 
polymerase DNA sequences. Plasmid insertion vectors have 
25 been constructed to insert chimeric genes into vaccinia virus 
for expression therein. Examples of such vectors are 
described by Mackett (Mackett et al. , 1984. J. Virol. 49:857- 
864). The DNA encoding the polymerase is inserted into a 
suitable restriction endpnuclease cloning site. in addition 
30 to plasmid insertion vectors, insertion vectors based on 
single-stranded M13 bacteriophage DNA (Wilson et al., 1986, 
Gene 49:207-213) can be used. 

The inserted polymerase DNA should preferably not 
contain introns, and insertion should preferably be so as to 
35 place the coding sequences in close proximity to the 

promoter, with no other start codons in between the initiator 
ATG and the 5' end of the transcript. 
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The plasmid insertion vector should contain 
transcriptional and translational regulatory elements that 
are active in vaccinia virus. The plasmid should be 
configured so that the polymerase sequences are under the 
S control of a promoter active in vaccinia virus. Promoters 
which can be used in the insertion vectors include but are 
not limited to the vaccinia virus thymidine kinase (TK) 
promoter, the 7.5K promoter (Cochran et al., 1985, J. Virol. 
54:30-37), the UK promoter (European Patent Publication 
10 0198328), the F promoter (Paoletti et al. , 1984, Proc. Natl. 
Acad. Sci. USA 81:193-197), and various early and late 
vaccinia promoters (see Moss, 1990, Virology, 2d ed., ch. 74, 
Fields et al., eds., Raven Press, Ltd., New York, pp. 2079- 
2111) . 

15 In a specific embodiment, the plasmid insertion 

vector contains (for eventual transfer into vaccinia virus) a 
T7 RNA polymerase coding sequence under the control of a 
promoter active in vaccinia virus, in another specific 
embc/diuicii. L ,- a plasmid insertion vector ccnLwir.c -«,- co- 
2 0 expression system consisting of divergently oriented 

promoters, one directing transcription of the polymerase 
sequences, the other directing transcription of a reporter 
gene or selectable marker, to facilitate detection or 
selection of the eventual recombinant vaccinia virus (see, 
25 e.g., Fuerst et al., 1987, Mol. Cell. Biol. 5:1918-1924).' 

As described supra, the plasmid insertion vector 
contains at least one set of polymerase coding sequences 
operatively linked to a promoter, flanked by sequences 
preferably nonessential for vaccinia viral replication. Such 
30 nonessential sequences include but are not limited to the TK 
gene (Mackett et al., 1984, J. Virol. 49:857-864), the 
vaccinia Hindlll-F DNA fragment (Paoletti et al., 1984, Proc. 
Natl. Acad. sci. USA 81:193-197), the vaccinia growth factor 
gene situated within both terminal repeats (Buller et al., 
35 1988, J. Virol. 62:866-874), the N2 and Ml genes (Tamin et 
al., 1988, Virology 165:141-150), the Ml subunit of the 
ribonucleotide reductase gene in the vaccinia Hindlll-l DNA 
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fragment (Child et al., 1990, Virology 174:625-629), the 
vaccinia hemagglutinin (Shida et al., 1988, J. Virol. 
62:4474-4480), vaccinia 14 kD fusion protein gene (Rodriguez 
et al., 1989, Proc. Natl. Acad. Sci. USA 86:1287-1291), etc. 
5 (see also Buller and Palumbo, 1991, Microbiol. Rev. 55(1) :80- 
122). TK sequences are preferred for use; use of such 
sequences results in the generation of TK - recombinant 
viruses . 

Recombinant vaccinia, viruses are preferably 
10 produced by transfection of the recombinant insertion vectors 
containing the polymerase sequences into cells previously 
infected with vaccinia virus. Alternatively, transfection 
can take place prior to infection with vaccinia virus. 
Homologous recombination takes place within the infected 
15 cells and results in the insertion of the foreign gene into 
the viral genome, in the region corresponding to the 
insertion vector flanking regions. The infected cells can be 
screened using a variety of procedures such as immunological 
techniques, DNA plaque hybridization, or genetic selection 
2 0 for recombinant viruses which subsequently can be isolated. 
These vaccinia recombinants preferably retain their essential 
functions and inf ectivity and can be constructed to 
accommodate up to approximately 35 kilobases of foreign DNA. 

Transfections may be performed by procedures known 
25 in the art, for example, a calcium chloride-mediated 
procedure (Mackett et al., 1985, The construction and 
characterization of vaccinia virus recombinants expressing 
foreign genes, in DNA Cloning, Vol. II, Rickwood and Hames 
(eds.), irl Press, Oxford-Washington, D.C.) or a 
30 liposome-mediated procedure (Rose et al., 1991, Biotechnigues 
10:520-525). 

Where, as is preferred, flanking TK sequences are 
used to promote homologous recombination, the resulting 
recombinant viruses thus have a disrupted TK region, 
35 permitting them to grow on a TK - host cell line such as Rat2 
(ATCC Accession No. CRL 1764) in the presence of 
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identity and purity, and expression of the inserted 
polymerase. 

5.5. LARGE SCALE GROWTH AND PURIFICATION 
5 OF RECOM BINANT REPLICABLE VESICULOVIRUSES 

The recovered recombinant vesiculovirus, after' 
plague-purification, can then be grown to large numbers, by 
way of example, as follows. Virus from a single plague (~10 6 
pfu) is recovered and used to infect ~10 7 cells (e.g., BHK 
1Q cells), to yield, typically, 10 ml at a titer of 10 9 -10 !0 
pfu/ml for a total of approximately 10 11 pfu. Infection of 
-10 12 cells can then be carried out (with a multiplicity of 
infection of 0.1), and the cells can be grown in suspension 
culture, large dishes, or roller bottles by standard methods. 
15 It ^ s noted that recombinant vesiculoviruses which 

no longer express the extracellular region of the 
vesiculovirus G protein (which determine host range) and 
which, instead, express an envelope glycoprotein of a 
different virus will need to be grown in cells which are 
20 susceptible to infection by the different virus (and which 
cells thus express a receptor promoting infection by a virus 
expressing the envelope glycoprotein of the different virus) . 
Thus, for example, where the recombinant vesiculovirus 
expresses the HIV envelope glycoprotein, the virus is grown 
25 in CD4 + cells (e.g., CD4 + lymphoid cells). 

Virus for vaccine preparations can then be 
collected from culture supernatants , and the supernatants 
clarified to remove cellular debris. If desired, one method 
of isolating and concentrating the virus that can be employed 
30 is b y passage of the supernatant through a tangential flow 
membrane concentration. The harvest can be further reduced 
in volume by pelleting through a glycerol cushion and by 
concentration on a sucrose step gradient. An alternate 
method of concentration is affinity column purification 
35 (Daniel et al., 1988, Int. J. Cancer 41:601-608). However, 
other methods can also be used for purification (see, e.g., 
Arthur et al., 1986, J. Cell. Biochem. Suppl. 10A:226), and 
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any possible modifications of the above procedure will be 
readily recognized by one skilled in the art. Purification 
should be as gentle as possible, so as to maintain the 
integrity of the virus particle. 

5 

5.6. RECOMBINANT REPLI CABLE VESICULOVIRUSES 
FOR USE AS LIVE VACCINES 

In one embodiment of the invention, the recombinant 

replicable vesiculoviruses that express an immunogenic 

Antigen are used as live vaccines. 

The recombinant vesiculoviruses for use as 

therapeutic or prophylactic live vaccines according to the 

invention are preferably somewhat attenuated. Most available 

strains e.g., laboratory strains of VSV, may be sufficiently 

15 attenuated for use. Should additional attenuation be 

desired, e.g., based on pathogenicity testing in animals, 
attenuation is most preferably achieved simply by laboratory 
passage of the recombinant vesiculovirus (e.g., in BHK or any 
other suit ;L1& .ye 3 1 I 'ii.j). Generally, attenuated viruses are 

20 obtainable by numerous methods known in the art including but 
not limited to chemical mutagenesis, genetic insertion, 
deletion (Miller, 1972, Experiments in Molecular Genetics, 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) or 
recombination using recombinant DNA methodology (Maniatis et 

25 al., 1982, Molecular Cloning, A Laboratory Manual, Cold 

Spring Harbor Laboratory, Cold Spring Harbor, NY), laboratory 
selection of natural mutants, etc. 

In this embodiment of the invention, a vaccine is 
formulated in which the immunogen is one or several 

30 recombinant vesiculovirus (es) , in which the foreign RNA in 
the genome directs the production of an Antigen in a host so 
as to elicit an immune (humoral and/or cell mediated) 
response in the host that is. prophylactic or therapeutic. In 
an embodiment wherein the Antigen displays the antigenicity 

35 or immunogenicity of an antigen of a pathogen, administration 
of the vaccine is carried out to prevent or treat an 
infection by the pathogen and/or the resultant infectious 
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disorder and/or other undesirable correlates of infection. 
In an embodiment wherein the Antigen is a tumor antigen, 
administration of the vaccine is carried out to prevent or 
treat tumors {particularly, cancer) . 
5 In a preferred specific embodiment, the recombinant 

vesiculoviruses are administered prophylactically , to 
prevent /protect against infection and/or infectious diseases 
or tumor (e.g., cancer) formation. 

In a specific embodiment directed to therapeutics, 

10 the recombinant vesiculoviruses of the invention, encoding 
immunogenic epitope(s), are administered, therapeutically , for 
the treatment of infection or tumor formation. 
Administration of such viruses, e.g., to neonates and other 
human subjects, can be used as a method of immunostimulation, 

15 to boost the host's immune system, enhancing cell-mediated 
and/or humoral immunity, and facilitating the clearance of 
infectious agents or tumors. The viruses of the invention 
can be administered alone or in combination with other 
therapies (examples of anti-viral therapies, including but 

2 0 not limited to a-interferon and vidarabine phosphate; 
examples of tumor therapy including but not limited to 
radiation and cancer chemotherapy) . 

5.7. INACTIVATED RECOMBINANT 
25 VESICULOVIRUSES FOR VACCINE USE 

In a specific embodiment, the recombinant 
replicable vesiculoviruses of the invention are inactivated 
(i.e., killed, rendered nonreplicable) prior to vaccine use, 
to provide a killed vaccine. Since the vesiculovirus 

30 envelope is highly immunogenic, in an embodiment wherein one 
or more foreign proteins (e.g., an envelope glycoprotein of a 
virus other than a vesiculovirus) is incorporated into the 
vesiculovirus envelope, such a virus, even in killed form, 
can be effective to provide an immune response against said 

35 foreign protein(s) in a. host to which it is administered. In 
a specific embodiment, a multiplicity of Antigens, each 
displaying the immunogenic ity or antigenicity of an envelope 
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glycoprotein of a different virus, are present in the 
recombinant vesiculovirus particle. 

The inactivated recombinant viruses of the 
invention differ from defective interfering particles in 
5 that, prior to inactivation the virus is replicable (i.e., it 
encodes all the vesiculovirus proteins necessary to enable it 
to replicate in an infected cell). Thus, since the virus is 

« 

originally in a replicable state, it can be easily propagated 

and grown to large amounts prior to inactivation, to provide 
10 a large amount of killed virus for use in vaccines, or for 

purification of the expressed antigen for use in a subunit 

vaccine (see Section 5.8, infra). 

Various methods are known in the art and can be 

used to inactivate the recombinant replicable vesiculoviruses 
15 of the invention, for use as killed vaccines. Such methods 

include but are not limited to inactivation by use of 

formalin, betapropiolactone, gamma irradiation, and psoralen 

plus ultraviolet light. 

In a toptsciiic embodiment, recombinant v^iculrviius 
20 can be readily inactivated by resuspension of purified 

virions in a suitable concentration of formaldehyde. While 
0.8 formaldehyde may be sufficient, verification of the 
optimum concentration of formaldehyde can be readily 
determined for a particular virus by titration of serial 
25 dilutions of formaldehyde with infectious virus to determine 
the inactivation curve of formalin for that virus. This 
technique has been described in detail by Salk and Gori, 
I960, Ann. N.Y. Acad. Sci . 83 : 609-637 ) . By extrapolation to 
zero, the concentration expected to inactivate the last 
30 infectious particle can be estimated. By utilizing a 

substantially higher concentration, e.g., 4-fold greater than 
the estimated concentration, complete inactivation can be 
assured. 

Although formalin inactivation alone has proven to 
35 be effective, it may be desirable, for safety and regulatory 
purposes, to kill the virus twice or more, using one or more 
of the numerous other methods currently known for virus 
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15 



inactivation. Thus, although not essential, it is 
contemplated that the virus used in the final formulation 
will be often inactivated by a second agent after treatment 
with formalin. 

5.8'. USE OF RECOMBINANT REPLICABLE 
VESICULOVIRUSES IN THE 
PRODUCTION OF SUBUNIT VACCINES 

Since the recombinant vesiculoviruses of the 

invention can be propagated and grown to large amounts, where 

the recombinant vesiculoviruses express an Antigen, growth of 

such vesiculoviruses provides a method for large scale 

production and ready purification of the expressed Antigen, 

particularly when the Antigen is incorporated into the 

envelope of the recombinant vesiculovirus. In a specific 

embodiment, the Antigen is all or a portion of an envelope 

glycoprotein of another virus, e.g., HIV gpl60, expressed as 

a nonfusion protein, or expressed as a fusion to the 

cytoplasmic domain of a vesiculovirus G protein. 

The Antigens thus produced and purified have use in 

subunit vaccines. 

The recombinant vesiculoviruses that express an 

Antigen can also be used to recombinantly produce the Antigen 

in infected cells in vitro, to provide a source of 'Antigen 

for use in immunoassays , e .g. , to detect or measure in a 

sample of body fluid from a vaccinated subject the presence 

of antibodies to the Antigen, and thus to diagnose infection 

or the presence of a tumor and/or monitor immune response of 

the subject subsequent to vaccination. 

5.9. DETERMINATION OF VACCINE EFFICACY 
Immunopotency of the one or more Antigen (s) in its 
live or inactivated vesiculovirus vaccine formulation, or in 
is subunit vaccine formulation, can be determined by 
monitoring the immune response of test animals following 

35 

immunization with the recombinant vesiculovirus (es) 
expressing the Antigen (s) or with the subunit vaccine 
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containing the Antigen, by use of any immunoassay known in 
the art. Generation of a humoral (antibody) response and/or 
cell-mediated immunity, may be taken as an indication of an 
immune response. Test animals may include mice, hamsters, 
5 dogs, cats, monkeys, rabbits, chimpanzees, etc., and 
eventually human subjects. 

Methods of introduction of the vaccine may include 
oral, intracerebral, intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, intranasal or any 
10 other standard routes of immunization. The immune response 
of the test subjects can be analyzed by various approaches 
such as: the reactivity of the resultant immune serum to the 
Antigen, as assayed by known techniques, e.g., enzyme linked 
immunosorbent assay (ELISA) , immunoblots, 
15 radioimmunoprecipitations, etc.; or, in the case where the 
Antigen displays the antigenicity or immunogenicity of a 
pathogen's antigen, by protection of immunized hosts from 
infection by the pathogen and/ or attenuation of symptoms due 
to infection by tuc pdU*uyeri in immunized hosts; or, ±n the 
20 case where the antigen displays the antigenicity or 

immunogenicity of a tumor antigen, by prevention of tumor 
formation or prevention of metastasis, or by regression, or 
by inhibition of tumor progression, in immunized hosts. 

As one example of suitable animal testing of a live 

2 5 vaccine, live vaccines of the invention may be tested in 

rabbits for the ability to induce an antibody response to the 
Antigens. Male specific-pathogen-free (SPF) young adult New 
Zealand White rabbits may be used. The test group of rabbits 
each receives approximately 5 x 10 8 pfu (plague forming units) 

3 0 of the vaccine. A control group of rabbits receives an 

injection in 1 mM Tris-HCl pH 9.0 of a non-recombinant 
vesiculovirus or of a recombinant vesiculovirus which does 
not express the same Antigen. 

Blood samples may be drawn from the rabbits every 
35 one or two weeks, and serum analyzed for antibodies to the 
Antigen(s). The presence of antibodies specific for the 
Antigen(s) may be assayed, e.g., using an ELISA. 
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Animals may also be. used to test vaccine efficacy 
(e.g., challenge experiments). For example, in a specific 
embodiment regarding a live vaccine formulation, monkeys each 
receive intradermally approximately 5 x 10 8 pfu of recombinant 
5 vesiculovirus. A control monkey receives (control) non- 
recombinant virus intradermally. Blood is drawn weekly for 
12 weeks, and serum is analyzed for antibodies to the 
Antigen (s) . 



10 



5.10. VACCINE FORMULATION AND ADMINISTRATION 
The vaccines of the invention may be multivalent or 
univalent. Multivalent vaccines are made from recombinant 
viruses that direct the expression of more than one Antigen, 
from the same or different recombinant viruses. 
15 Man y methods may be used to introduce the vaccine 

formulations of the invention; these include but are not 
limited to oral, intradermal, intramuscular, intraperitoneal, 
intravenous, subcutaneous, intranasal routes, and via 
scarification (scratching through the top layers or skin, 
20 e.g., using a bifurcated needle). 

The patient to which the vaccine is administered is 
preferably a mammal, most preferably a human, but can also be 
a non-human animal including but not limited to cows, horses, 
sheep, pigs, fowl (e.g., chickens), goats, cats, dogs, 
25 -hamsters, mice and rats. In the use of a live vesiculovirus 
vaccine, the patient can be any animal in which vesiculovirus 
replicates (for example, the above-listed animals). 

The virus vaccine formulations of the invention 
comprise an effective immunizing amount of one or more 
30 recombinant vesiculoviruses (live or inactivated, as the case 
• may be) and a pharmaceutical^ acceptable carrier or 

excipient. Subunit vaccines comprise an effective immunizing 
amount of one or more Antigens and a pharmaceutical^ 
acceptable carrier or excipient. Pharmaceutical^ acceptable 
35 carriers are well known in the art and include but are not 
limited to saline, buffered saline, dextrose, water, 
glycerol, sterile isotonic aqueous buffer, and combinations 



- 55 - 



BNSDOCID: <WO 9634625A1_I_> 



WO 96/34625 PCT/US96/06053 



. thereof. One example of such an acceptable carrier is a 
physiologically balanced culture medium containing one or 
more stabilizing agents such as stabilized, hydrolyzed 
proteins, lactose, etc. The carrier is preferably sterile. 
5 The formulation should suit the mode of administration. 

The composition, if desired, can also contain minor 
amounts of wetting or emulsifying agents, or pH buffering 
agents. The composition can be a liquid solution, 
suspension, emulsion, tablet, pill, capsule, sustained 
10 release formulation, or powder. Oral formulation can include 
standard carriers such as pharmaceutical . grades of mannitol, 
lactose, starch, magnesium stearate, sodium saccharine, 
cellulose, magnesium carbonate, etc. 

Generally, the ingredients are supplied either 
15 separately or mixed together in unit dosage form, for 
example, as a dry lyophilized powder or water free 
concentrate in a hermetically sealed container such as an 
ampoule or sachette indicating the quantity of active agent. 
Where the composition is administered by injection, an 
20 ampoule of sterile diluent can be provided so that the 
ingredients may be mixed prior to administration. 

In a specific embodiment, a lyophilized recombinant 
vesiculovirus of the invention is provided in a first 
container; a second container comprises diluent consisting of 
25 an aqueous solution of 50% glycerin, 0.25% phenol, and an 
antiseptic (e.g., 0.005% brilliant green). 

The precise dose of virus, or subunit vaccine, to 
be employed in the formulation- will also depend on the route 
of administration, and the nature of the patient, and should 
30 be decided according to the judgment of the practitioner and 
each patients circumstances according to standard clinical 
techniques. An effective immunizing amount is that amount 
sufficient to produce an immune response to the Antigen in 
the host to which the recombinant vesiculovirus, or subunit 
35 vaccine, is administered. 

In a specific embodiment, an effective' immunizing 
amount of a live recombinant vesiculovirus of the present 
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invention is within the range of 10 3 to 10 9 pfu/dose, more 
preferably 10 6 to 10' pfu/dose. Boosting is possible but not 
preferred. If boosting is desired, one optionally may boost 
with the Antigen in purified form rather than using a 
5 recombinant vesiculovirus of the invention. 

For inactivated recombinant vesiculovirus vaccines, 
the vaccine formulation comprises an effective immunizing 
amount of the inactivated virus, preferably in combination 
with an immunostimulant ; and a pharmaceutical^ acceptable 
10 carrier. As used in the present context, "immunostimulant" 
is intended to encompass any compound or composition which 
has the ability to enhance the activity of the immune system, 
whether it be a specific potentiating effect in combination 
with a specific antigen, or simply an independent effect upon 
15 the activity of one or more elements of the immune response. 
Some of the more commonly utilized immunostimulant compounds 
in vaccine compositions are the adjuvants alum or muramyl 
dipeptide (MDP) and its analogues. Methods of utilizing 
these materials are known ; in the art, and it is well within 
20 the ability of the skilled artisan to determine an optimum 
amount of stimulant for a given virus vaccine. It may also 
be desired to use more than one immunostimulant in a given 
formulation. 

The exact amount of inactivated virus utilized in a 
25 given preparation is not critical, provided that the minimum 
amount of virus necessary to provoke an immune response is 
given. A dosage range of as little as about io jug, up to 
amount a milligram or more, is contemplated. As one example, 
in a specific embodiment, individual dosages may range from 
30 about 50-650 fig per immunization. 

Use of purified Antigens as subunit vaccines can be 
carried out by standard methods. For example, the purified 
protein (s) should be adjusted to an appropriate 
concentration, formulated with any suitable vaccine adjuvant 
35 and packaged for use. Suitable adjuvants may include, but 
are not limited to: mineral gels, e.g., aluminum hydroxide; 
surface active substances such as lysolecithin, pluronic 
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polyols; polyanions; peptides; oil emulsions; alum, and MDP. 
The immunogen may also be incorporated into liposomes, or 
conjugated to polysaccharides and/or other polymers for use 
in a vaccine formulation. In instances where the recombinant 
5 Antigen is a hapten, i.e., a molecule that is antigenic in 
that it can react selectively with cognate antibodies, but 
not immunogenic in that it cannot elicit an immune response, 
the hapten may be covalently bound to a carrier or 
immunogenic molecule; for instance, a large protein such as 
10 serum albumin will confer immunogenicity to the hapten 

coupled to it. The hapten-carrier may be formulated for use 
as a vaccine. 

Effective doses (immunizing amounts) of the 
vaccines of the invention may also be extrapolated from dose- 
15 response curves derived from animal model test systems. 

The invention also provides a pharmaceutical pack 
or kit comprising one or more containers comprising one or 
more of the ingredients of the vaccine formulations of the 
invention. Associated with such container(s) can bt «.'.r.;U«» 
2 0 in the form prescribed by a governmental agency regulating 
the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 

The present invention thus provides a method of 
25 immunizing an animal, or treating or preventing various 

diseases or disorders in an animal, comprising administering 
to the animal an effective immunizing dose of a vaccine of 
the present invention. 

30 5.11. USE OF ANTIBODIES GENERATED 

BY THE VACCTMPS OF THE TNVRNTTmr 

The antibodies generated against the Antigen by 
immunization with the recombinant viruses of the present 
invention also have potential uses in diagnostic 
35 immunoassays, passive immunotherapy, and generation of 
antiidiotypic antibodies. 
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The generated antibodies may be isolated by 
standard techniques known in the art (e.g., immunoaf f inity 
chromatography, centrif ligation, precipitation, etc.) and used 
in diagnostic immunoassays. The antibodies may also be used 
5 to monitor treatment and/or disease progression. Any 
immunoassay system known in the art, such as those listed 
supra, may be used for this purpose including but not limited 
to competitive and noncompetitive assay systems using 
techniques such as radioimmunoassays, ELISA (enzyme- linked 
10 immunosorbent assays), "sandwich" immunoassays, precipitin 
reactions, gel diffusion precipitin reactions, 
immunodiffusion assays, agglutination assays, complement- 
fixation assays, immunoradiometric assays, fluorescent 
immunoassays, protein A immunoassays and 
15 immunoelectrophoresis assays, to name but a few. 

The vaccine formulations of the present invention 
can also be used to produce antibodies for use in passive 
immunotherapy, in which short-term protection of a host is 
achieved by the administration- of pre-formed antibody 
20 directed against a heterologous organism. 

The antibodies generated by the vaccine 
formulations of the present invention can also be used in the 
production of antiidiotypic antibody. The antiidiotype 
antibody can then in turn be used for immunization, in order 
2 5 to produce a subpopulation of antibodies that bind the 

initial antigen of the pathogenic microorganism (Jerne, 1974, 
Ann. Immunol. (Paris) 125c:373; Jerne, et al., 1982, EMBO J. 
1:234) . 

30 6. RECOMBINANT VESICULAR STOMATITIS 

VIRUSES FROM DNA 

We assembled a DNA clone containing the 11,161 
nucleotide sequence of the prototype rhabdovirus , vesicular 
stomatitis virus (VSV) , such that it could be transcribed by 
35 the bacteriophage T7 RNA polymerase to yield a full-length 
positive strand RNA complementary to the VSV genome. 
Expression of this RNA in cells also expressing the VSV 
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nucleocapsid protein and the two VSV polymerase subunits 
resulted in production of VSV with the growth 
characteristics of wild-type VSV. Recovery of virus from DNA 
was verified by: 1) the presence of two genetic tags 
5 generating novel restriction sites in DNA derived from the 
genome; 2) direct sequencing of the genomic RNA of the 
recovered virus, and 3) , production of a VSV recombinant in 
which the glycoprotein was derived from a second serotype. 
The ability to generate VSV from DNA opens numerous 

10 possibilities for the genetic analysis of VSV replication. 

In addition, because VSV can be grown to very high titers and 
in large quantities with relative ease, one can genetically 
engineer recombinant VSVs displaying novel antigens. Such 
modified viruses can be used as vaccines conferring 

15 protection against other viruses or pathogenic 

microorganisms, or to produce immunity in general against an 
encoded foreign antigen. 

6.1. MATERIALS AND METHODS 

20 Plasmid Construction. The plasmid pVSVFL(+) 

expressing the 11,161 nucleotide positive strand 
(antigenomic) VSV RNA sequence was constructed from four DNA 
fragments cloned into pBluescript SK+ (Stratagene) . The 
starting plasmid for the construction, pVSVFL(-) , expressed 

25 the complete negative sense VSV genomic RNA (Indiana 

serotype) from a T7 promoter. This plasmid was generated in 
a nine step cloning procedure that involved joining the five 
original cDNA clones of the VSV mRNAs (Gall i one et al., 1981, 
J. Virol. 39:529-535; Rose, and Gallione, 1981, J. Virol. 

30 39:519-528; Schubert et al., 1985, Proc. Natl. Acad. Sci. USA 
82:7984-7988) with gene junction fragments and terminal 
fragments. These fragments were generated by reverse 
transcription and polymerase chain reaction (RT-PCR) (Mullis 
and Faloona, 1987, Methods in Enzymology 155:335-350) from 

35 VSV genomic RNA (M.A. Whitt, R. Burdine, E.A. Stillman and 
J.K. Rose, manuscript in preparation). To facilitate 
engineering of the VSV genome and to provide genetic tags, 
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unique Mlu I and Nhe I restriction enzyme sites were 
introduced by oligonucleotide-directed mutagenesis into the 
5' and 3' non-coding regions flanking the VSV glycoprotein 
gene prior to construction of the full length genome. 
5 In the initial step of constructing pVSVFL(+) we 

used the primers 

( 5 ' CCGGCTCGAGTTGTAATACGACTCACTATAf;r;r;Ar»r:^&r:&rLa Ziriv ^^nn^rprp^rprp^rp 
C-3') (SEQ ID NO: 38) and ( 5 ' GAACTCTCCTCTAGATGAGAAC-3 ' ) (SEQ 
ID NO: 39) to amplify (Mullis and Faloona, 1987, Methods in 

10 Enzymology 155:335-350) a 2,124 nucleotide fragment from 
pVSVFL(-) (# 1, Fig. 4A) . This fragment corresponds to the 
3' end of the VSV genome. The first primer introduced an Xho 
I site and a T7 promoter (underlined) immediately preceding 
the sequence complementary to the 3' end of the VSV genome. 

15 The second primer covered a unique Xba I site present in the 
VSV P gene. The PCR product was digested with Xho I and Xba 
I and cloned into pBluescript SK + (Stratagene) that had been 
digested with Xho I and Xba I. The resulting plasmid 
carrying the sequence corresponding to the 3' end of the VSV 

20 genome preceded by a T7 promoter was designated pBSXX. Note 
that an additional T7 promoter is also present upstream of 
the Xho I site in the vector. Next we generated the sequence 
corresponding to the 5' end of the VSV genome and part of the 
hepatitis delta virus (HDV) ribozyme (Pattnaik et al., 1992, 
25 cell 69:1011-1120; Perrotta and Been, 1991, Nature 

350:434-436). A 147 nucleotide PCR product (#3, Fig. 4A) was 
amplified from pVSVFL(-) with primers 

( 5 ' AGGTCGGACCGCGAGGAG GTGGAGATGCCATGCCGACCC ACGAA^ArPAPfl a&arpar. 
-3') (SEQ ID NO:40) and (5 ' ATGTTGAAGAGTGACCTACACG-3 ' ) (SEQ ID 

30 N0:41) . The first primer contained 39 nucleotides of the 
sequence encoding the HDV ribozyme (underlined) followed by 
19 nucleotides complementary to the 3' end of the VSV 
antigenomic RNA. The second primer hybridized within the L 
gene (Fig. 4A) . The PCR product was digested with Afl II and 

35 Rsr II and the. 80 nucleotide Afl II-Rsr II fragment was 

ligated to a 225 nucleotide Rsr II-Sac I. fragment (#4, Fig. 
4A) derived from a plasmid designated pBS-GMG (Stillman et 

- 61 - 



BNSDOCID: <WO 9634625A 1 _l_> 



WO 96/34625 



PCT/US96/06053 



al. , manuscript submitted). Fragment 4 contained -the T7 
terminator sequence and the remainder of the sequence 
encoding the HDV ribozyme. Ligated products were digested 
with Afl II and Sac I and the 305 nucleotide Afl II-Sac I 
5 product was cloned into the Afl II and Sac I sites of a 

modified pBSXX vector that contained an Afl II site inserted 
at the unique Not I site within the polylinker. This plasmid 
containing the Afl II-Sac I fragment was designated pBXXAS. 
To complete the construction, a 10,077 nucleotide Bst 1107 I 

10 to Afl II fragment (#2, Fig. 4A) containing 90% of the VSV 
sequences from pVSVFL(-) was inserted into the unique Bst 
1107 I and Aflll sites of pBXXAS. The final plasmid was 
designated pVSVFL(+) . The sequences in this plasmid 
generated by PCR (hatched sequences, Fig. 4B) were determined 

15 and contained no errors. We also prepared a plasmid in which 
the sequence of the VSV Indiana serotype G gene (Mlul-Nhel) 
was replaced with the G gene from the New Jersey serotype of 
VSV fGallione and Rose, 1983, J. Virol. 46:162-169). This 
pl«rr.ii is called pVSVFL (+) and has only a single T7 

20 promoter. 

Transfection and recovery of recombinant VSV. Baby 
hamster kidney cells (BHK-21, ATCC) were maintained in DME 
(Dulbecco's modified Eagle's medium) supplemented with 5% 
fetal bovine serum (FBS) . Cells on 10 cm dishes ("70% 

25 confluent) were infected at a multiplicity of 10 with VTF7-3 
(Fuerst et al., 1986, Proc. Natl. Acad. Sci. USA 
83:8122-8126). After 30 min, plasmids encoding the VSV 
antigenomic RNA and the N, P, and L proteins were tzransfected 
into the cells using a calcium phosphate transfection kit 

30 according to directions supplied (Stratagene) . The coding 
regions for N, P, and L proteins were each expressed in 
pBluescript SK(+) from the T7 promoter. Plasmid amounts were 
10 M9 pVSVFL( + ), 5 /xg pBS-N, 4 jug pBS-P, and 2 ixg pBS-L. 
After 24-48 h incubation at 37 °C in 3% C0 2 , cells were 

35 scraped from the dish and subjected to three rounds of 
freeze-thawing (-70°C, 37 °C) to release cell-associated 
virus. Debris was pelleted from the cell lysates by 
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centrifugation at 1,250 x g for 5 min. Five ml of this 
lysate was added to approximately 10 6 BHK cells on a 10 cm 
plate in 10 ml of DME + 5% FBS . After 48 h the medium was 
clarified by centrifugation at 1,250 x g for 10 min, and 
5 passed through a filter to remove the majority of the 

vaccinia virus (0,2 tim pore size, Gelman Sciences). One ml 
was then added directly to BHK cells that had been plated on 
a coverslip in a 35 mm dish. After four hours, the cells 
were fixed in 3% paraformaldehyde and stained with monoclonal 

10 antibody II to the VSV G, protein (Lefrancois and Lyles, 1982, 
Virology 121:168-174) or 9B5 (Bricker et al., 1987, Virology 
161:533-540) to the VSV G NJ protein followed by goat anti-mouse 
rhodamine conjugated antibody (Jackson Research) . Cells were 
then examined by indirect immunofluorescence using a Nikon 

15 Microphot-FX microscope equipped with a 4 Ox planapochromat 
objective. When VSV recovery was successful, 100% of the 
cells showed the typical bright stain for G protein 
characteristic of a VSV infection. 

Preparation and analysis' wf 7£V RKA and protein. 

2 0 Recombinant VSV and wild-type VSV isolated from single 
plaques (~10 5 plaque forming units) were used to infect a 
monolayer of BHK cells {-80% confluent) on a 10 cm dish in 
10 ml DME plus 5% FBS. After 24 h, cell debris and nuclei 
were removed by centrifugation at 1,250 x g for 5 min, and 

25 virus was then pelleted from the medium at 35,000 RPM in a 
Beckman SW41 rotor for one hour. Virus pellets were 
resuspended in 0.5 ml 10 mM Tris-HCl, pH 7.4 for protein 
analysis. For RNA isolation, virus was resuspended in 0.2 ml 
of 0.5% SDS/0.2M sodium acetate, pH 8.0, followed by 

30 extraction with phenol/CHCl 3 . RNA was precipitated with 95% 
ethanol and 5 Atg carrier tRNA. RNA was pelleted by 
centrifugation at 12,000 x g for 15 min and resuspended in 
water with 1 unit RNasin (Promega) . For analysis of RNA by 
RT-PCR, primer pairs flanking either the novel Nhe I or Mlu I 

.35 sites were used. The first strand DNA synthesis reaction was 
carried out in 50 pi of PCR buffer (Promega) containing 5 mM 
MgCl 2 , 1 mM dNTPs, 1 unit RNAs in (Promega) , 1 unit avian 
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myeloblastosis virus reverse transcriptase (AMV RT; Promega} 
0.75 /iM primer and approximately 0.25 fig of VSV genomic RNA. 
Incubation was at 42 °C for 15 min followed by 5 min at 99 °C 
and 5 min at 5°C. PCR was carried out by addition of 0.5 U 
5 Taq polymerase, adjustment of MgCl 2 concentration to 1.25 mM, 
and addition of the second primer (0.75 MM) . The reaction 
was subjected to 20 thermal cycles: 95°C, 1 min; 60°C 1.5 * 
min. The reaction was then incubated at 60°C for 7 min. 

Direct sequencing of VSV genomic RNA was performed 
10 according to a previously described protocol based on the 
dideoxy chain termination method (Mierendorf and Pfeffer, 
1987, Methods in Enzymology 152:563-566) except that 
[Qr- 33 P]dATP (Amersham, Inc.) was used. Each reaction included 
approximately 0.25 /ig of VSV genomic RNA. 

15 

6.2. RESULTS 
To construct a cDNA clone encoding the entire 
11,161 VSV genome, individual cDNA clones of the VSV mRNAs 
wer<- ir.itizlxy joined using small DNA fragments generated by 

20 RT-PCR that covered the four gene junctions. Correct genomic 
terminal sequences were also generated by RT-PCR' of the VSV 
genome, and these were joined to the other DNAs using 
restriction sites. This initial clone was constructed with a 
T7 promoter directing synthesis of the full-length negative 

25 strand VSV RNA. Despite numerous attempts, we were unable to 
recover VSV from cells expressing the VSV genomic RNA and the 
VSV N, P, and L proteins. The VSV constructed was thus 
redesigned to express the VSV antigenomic DNA. The 
construction strategy is described in Materials and Methods 

30 and in Fig. 4A-B. The entire VSV sequence as well as a T7 
promoter, terminator and HDV ribozyme sequence were cloned in 
pBluescript SK+ between the Xho I and Sac I sites (Fig. 4B; 
Fig. 1) . An additional T7 promoter is also present upstream 
of the Xho I site in the plasmid. A slightly different 

35 cloning strategy was used to generate plasmids lacking the 
upstream T7 promoter and VSV has also been recovered from 
these constructs. 
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Recovery of VSV from dna . To determine if we could 
recover VSV from plasmid DNA, we infected cells with vaccinia 
VTF7-3 (Fuerst et al. , 1986, Proc. Natl. Acad. Sci. USA 
83:8122-8126) to provide cytoplasmic T7 RNA polymerase. 
5 These cells were then transfected with pVSVFL(+) , which 
expresses the antigenomic VSV RNA from a T7 promoter, and 
three other plasmids which express the VSV N, P, and L 
proteins. Expression of the N protein was required to 
assemble nascent VSV antigenomic RNA into nucleocapsids . 
10 Once formed, these nucleocapsids should serve as templates 
for synthesis of minus strand RNA by the L/P polymerase 
complex. Encapsidated minus strand RNA should then be a 
template for transcription, initiating the VSV infectious 
cycle. 

15 The initial recovery experiment employed two 10 cm 

plates of BHK cells (-5 x 10 6 cells each). At 24 hours after 
the infection with VTF7-3 and transfection with the four 
plasmids, cells and medium were frozen and thawed to release 
any cell-associated VSV, and the ciatific^ iysates were added 

20 to fresh BHK cells. After 48 hours, both plates showed 

severe cytopathic effects that could have been due either to 
vaccinia virus or to recovered VSV. One ml of each 
supernatant was then added to small dishes of BHK cells on 
coverslips. After two hours, one of these coverslips showed 

25 rounded cells characteristic of a VSV infection, while the 
other did not. After 4 hours, cells on both coverslips were 
fixed, stained with appropriate antibodies, and examined by 
indirect immunofluorescence microscopy to detect the VSV G 
protein. All cells on the coverslip showing rounded cells 

3 0 revealed intense fluorescence characteristic of G protein 

expression during VSV infection (data not shown) . Subsequent 
passaging and analysis described below showed that VSV had 
been recovered from the transfection. The other coverslip 
showed no G expression, and no VSV could be recovered after. 

35 passaging. 

Based on the frequency with which rabies virus 
(Schnell et al., 1994, EMBO J. 13:4195-4203) and VSV 
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minigenomes (Stillman et al. , manuscript submitted) were 
recovered, we anticipated that recovery of complete VSV, if 
obtainable, would be a rare event. The initial recovery of 
VSV from only one of two transf ections suggested the 
5 possibility that the initial titer in the positive lysate was 
very low. To examine this titer, we infected BHK cells on 
coverslips with one tenth of the lysate (1 ml) derived from 
each initial transf ection. After eight hours, the cells were 
examined for expression of G protein by indirect 

10 immunofluorescence. A scan of the entire coverslip revealed 
no VSV infection from the negative lysate, and only five 
small areas of infection (2-6 cells each) from the lysate 
that gave rise to VSV G expression on subsequent passaging. 
The initial titer was therefore very low as we suspected, and 

15 likely represented a total of about 50 infectious particles, 
probably derived from a VSV infection initiated in only one 
cell out of 2 x 10 7 transf ected. This low rate of recovery of 
infectious VSV is typical of that observed in several 
experiment z \ -■■ - • 

20 Analysis of viral proteins. Subsequent passages 

and plaque assays of VSV recovered in three independent 
experiments revealed plaques that were detectable in less 
than 16 hours and titers up to 2 x 10 9 pfu/ml characteristic 
of VSV. For further verification that VSV had been 

25 recovered, the proteins in virus pelleted from the medium 
were examined by SDS polyacrylamide gel electrophoresis 
(PAGE) . Fig. 5 shows the Coomassie stained gel of proteins 
of VSV recovered from recombinant DNA (rVSV) and wildtype 
VSV. The mobilities and relative amounts of the five viral 

30 proteins were indistinguishable in the wildtype and 
recombinant virus. 

Identification of sequence tags. In pVSVFL(+), the 
VSV nucleotide sequence was altered by oligonucleotide- 
directed mutagenesis to generate unique Mlu I and Nhe I 

35 restriction enzyme sites in the 5' and 3' non-coding regions 
of the glycoprotein gene. To verify that these sites were 
present in recovered virus, we carried out reverse 
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transcription of genomic RNA purified from wild-type or 
recombinant virions using primers upstream of each 
restriction site. The reverse transcription products were 
then amplified by PCR using an additional primer downstream 
5 of each restriction site. The presence of the genetic tag in 
the recombinant virus was verified by digestion of the PCR 
products with the appropriate restriction enzymes. Using 
this method, the presence of both the Mlu I and Nhe I 
sequences in the recovered virus RNA was verified, and the 

10 results for the Nhe I site are shown in Fig. 6. Sequences 
from wild-type VSV and recombinant VSV were amplified in 
parallel and a 620 nucleotide fragment was obtained in both 
cases (lanes 3 and 5) . No product was obtained when reverse 
transcriptase was omitted from the reactions prior to PCR 

15 (lanes 1 and 2), indicating that the PCR product was derived 
from RNA, not from contaminating DNA. After digestion with 
Nhe I, expected fragments of 273 and 34 7 base pairs were 
obtained from recombinant VSV RNA, while the DNA derived from 
the wildtype KNA remained undigested \la.;:zz - A -and 6) . 

2 0 Direct sequencing of tagged genomic RNA. The 

presence of new restriction sites in the DNA generated by PCR 
provided strong evidence that VSV had been recovered from 
DNA. To ensure that identification of the genetic tags by 
PCR had not resulted from inadvertent contamination by 

25 plasmid DNA, we carried out direct sequence analysis of the 
genomic RNA using reverse transcriptase and a primer 
hybridizing upstream of the Nhe I site. The sequence from 
the autoradiogram shown in Fig. 7 is in exact agreement with . 
the published sequence of the VSV G mRNA (Rose and Gallione, 

30 1981, J. Virol. 39:519-528) except that the four nucleotide 
changes used to generate the Nhe I site (GCACAA to GCTAGC) 
are present. These results show unequivocally that the 
sequence tag is present in the genomic RNA. 

Recombinant VSV Indiana virus carrying the 

35 glycoprotein of the New Jersey serotype. There are two 
serotypes of VSV designated Indiana and New Jersey. The 
glycoproteins of the two serotypes share approximately 50% 
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sequence identity (Gallione and Rose, 1983, J. Virol. 
46:162-169). In earlier studies we found that the 
glycoprotein of the New Jersey serotype could complement a 
mutant of the VSVj serotype that makes a defective 
5 glycoprotein (Whitt et al., 1989, J. Virol. 63:3569-357,8). 
It therefore seemed likely that a recombinant VSV in which 
the Indiana glycoprotein (G,) gene was replaced by the New 
Jersey glycoprotein (G NJ ) gene would be viable despite the 
extensive sequence divergence. To generate such a 

10 recombinant, the G NJ cDNA was amplified by PCR using primers 
that introduced Mlu I and Nhe I sites within the 5' and 3' 
non-coding regions at each end of the gene. The amplified 
DNA was cloned into pBluescript and the G NJ protein was 
expressed in BHK cells using the vaccinia-T7 system. The 

15 protein expressed was shown to have membrane fusion activity 
below pH 6.0 indicating that it was functional (data not 
shown). This G NJ cDNA was then cloned into the unique Mlu I 
and Nhe I sites of the full-length construct after removal of 
sequences enbcdi:^ G, Recombinant VSV was recovered 

20 essentially as described above except that the initial 

transf ection was allowed to proceed for 48 hours before the 
freeze-thaw step. After the first passage, expression of the 
G NJ protein was verified by indirect immunofluorescence using 
a monoclonal antibody specific to G NJ (Bricker et al., 1987, 

25 Virology 161:533-540). The virus was then plaque purified 
and grown. To examine the proteins present in the 
recombinant virus, virus recovered from cells infected with 
VSVj , VSV NJ , and the recombinant VSV m}C was analyzed by SDS- 
PAGE followed by Coomassie staining. The VSV, G, N, P, and M 

30 proteins each have mobilities distinct from their VSV NJ 

counterparts (Fig. 8, lanes 1 and 3). The recombinant VSV^^ 
shows the mobility difference in only the G protein as 
. expected (lane 2). The presence of the novel Nhe I and Mlu I 
sites in the recombinant was also verified (data not shown) . 

35 
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6*3. DISCUSSION 
The results presented here establish that 
infectious VSV can be recovered from recombinant DNA. We 
believe that expressing the positive strand, antigenomic RNA 
5 in the presence of the N, P and L proteins was critical to 
our success because we have not recovered virus starting with 
an equivalent construct encoding the genomic RNA. 

Why is the initial event of generating VSV so rare, 
apparently occurring in only 1 in 10 7 to 10 8 transfected cells? 
10 One possibility is that our clone contains a sequence error 
that is only corrected by a rare mutational event. We 
believe this is not the case because the clone was completely 
sequenced prior to assembly and differences from published 
sequences were corrected, or the proteins were shown to be 
15 functional in complementation assays. Also, the frequency of 
recovery is actually higher than expected based on our 
observations with minigenomes encoding one or two VSV 
proteins (Stillman et al., manuscript submitted). m these 
cases we found that a transcribing and replicating minigenome 

2 0 (~2kb RNA) was recovered in about 1 in 10 2 transfected cells 

expressing the RNA with the N,P and L proteins. Addition of 
a second cistron (0.85 kb additional RNA) encoding the M 
protein dropped the recovery rate to approximately l in 10 3 
transfected cells. If there is a ten-fold drop in recovery 
25 rate for each additional kilobase of RNA added, one can 

easily rationalize an even lower frequency of recovery for 
the 11, 161 kb genome than we observed. Although these 
minigenomes encode negative sense RNAs, the comparison of the 
frequency of recovery to that of the full length plus 

3 0 construct is probably valid because expression of the N, P 

and L mRNAs would not generate mRNAs complementary to the 
minigenome. 

Although the rate limiting step in generation of 
infectious VSV is not known, it is likely to be at the level 
*5 of synthesis and encapsidation of the large antigenomic RNA, 
which must occur prior to replication and transcription. The 
complete encapsidation with N protein probably has to occur 
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on the nascent RNA to protect it from degradation, and the 
cells in which this occurs must also produce appropriate 
amounts of L and P proteins to initiate replication. Once 
this has occurred, however, the transcription and translation 
S of the genome should generate additional N, P, and L proteins 
as well as the G and M proteins required for budding of 
infectious virus. 

The recovery of VSV from DNA opens numerous aspects 
of the viral life cycle to genetic analysis. The studies of 
10 the genetic signals involved in transcription and replication 
have so far been confined to analysis of defective RNAs that 
do not encode viral proteins (Pattnaik et al., 1992, Cell 
69:1011-1120; Wertz et al. , 1994, Proc. Natl. Acad. 'sci. USA 
91:8587-8591). These and other signals can be now examined 
15 in the context of a VSV infection occurring in the absence of 
a vaccinia virus infection. The system we have described 
also provides an opportunity to study the roles of individual 
viral protein domains and modifications in viral assembly and 
replication., Hb^y these analyses have been confined to 
20 in vitro systems or to analysis employing the complementation 
of naturally occurring mutants where synthesis of the mutant 
protein can complicate the analysis. 

Perhaps even more exciting is the ability to use 
VSV as a vector to express other proteins. The experiment in 
25 which we recovered VSV Indiana carrying the glycoprotein from 
the New Jersey serotype (Fig. 8) illustrates that viable 
recombinants can be made. For reasons that are unclear the 
titers of recombinant virus were at least ten-fold lower than 
those obtained with either parent. The lower titer apparently 
30 did not result from a defect in viral assembly because the 
amounts of proteins in wildtype and recombinant virions at 
the end of the infection were comparable (Fig. 8) . Our 
previous experiments showed that a foreign glycoprotein 
carrying the appropriate cytoplasmic tail signal could be 
35 incorporated into the VSV envelope (Owens and Rose, 1993, j. 
Virol. 67:360-365). This suggests that one may generate' 
recombinant VSVs carrying novel proteins in their envelopes. 
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If these were appropriately attenuated, they can be used as 
vaccines against other viral diseases. 

The truncated genomes of defective interfering 
particles are replicated and packaged very well, thus we 
5 suspect that there will be flexibility in the maximum length 
of the genome that can be packaged as well. Presumably a 
longer nucleocapsid can be packaged as a longer bullet-shaped 
particle. Because of the modular nature of the VSV genome, 
with conserved gene end and start seguences at the gene 
10 junctions (Rose and Schubert, 1987, in The Viruses: The 

Rhabdoviruses, Plenum Publishing Corp., NY, pp. 129-166), it 
should be relatively easy to engineer additional genes into 
VSV. 

15 1 ■ DEPOSIT OF. MICROORGANISMS 

Plasmid pVSVFL(+) was deposited on May 2, 1995 with 
the American Type Culture Collection (ATCC) , 1201 Parklawn 
Drive, Rockville, Maryland 20852, under the provisions of the 
Budapest xi-aaty on the International Recognition of the 
20 Deposit of Microorganisms for the Purposes of Patent 
Procedures, and assigned accession no. 97134. 

The present invention is not to be limited in scope 
by the microorganism deposited or the specific embodiments 

25 described herein. Indeed, various modifications of the 

invention in addition to those described herein will become 
apparent to those skilled in the art from the foregoing 
description and accompanying figures. Such modifications are 
intended to fall within the scope of the appended claims. 

30 Various publications are cited herein, the 

disclosures of which are incorporated by reference in their 
entireties. 



35 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Rose, John K. 

(ii) TITLE OF INVENTION: RECOMBINANT VESICULOVIRUSES AND THEIR 
USES 

(iii) NUMBER OF SEQUENCES: 41 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PENNIE & EDMONDS 

(B) STREET: 115 5 Avenue of the Americas 
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(F) ZIP: 10036-2711 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1. 30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: On Even Date Herewith 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION* 

(A) NAME: Misrock, S. Leslie 

(B) REGISTRATION NUMBtK 18,8 72 

(C) REFERENCE /DdCrvtii .NUMBER : 6523-009-228 

(ix) TELECOMMUNICATION INFORMATION ■ 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 86 9-9741/8864 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO : 1 : 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 14311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 760., 2025 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2092.. 2886 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2946.. 3632 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(E) LOCATION: 3774.. 5306 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 5429.. 11755 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CACCTAAATT GTAAGCGTTA ATATTTTGTT AAAATTCGCG TTAAATTTTT GTTAAATCAG 
CTCATTTTTT AACCAATAGG CCGAAATCGG CAAAATCCCT TATAAATCAA AAGAATAGAC 
CGAGATAGGG TTGAGTGTTG TTCCAGTTTG GAACAAGAGT CCACTATTAA AGAACGTGGA 
CTCCAACGTC AAAGGGCGAA AAACCGTCTA TCAGGGCGAT GGCCCACTAC GTGAACCATC 
ACCCTAATCA AGTTTTTTGG GGTCGAGGTG CCGTAAAGCA CTAAATCGGA ACCCTAAAGG 
GAGCCCCCGA TTTAGAGCTT GACGGGGAAA GCCGGCGAAC GTGGCGAGAA AGGAAGGGAA 
GAAAGCGAAA GGAGCGGGCG CTAGGGCGCT GGCAAGTGTA GCGGTCACGC TGCGCGTAAC 
CACCACACCC GCCGCGCTTA ATGCGCCGCT ACAGGGCGCG TCCCATTCGC CATTCAGGCT 
GCGCAACTGT TGGGAAGGGC GATCGGTGCG GGCCTCTTCG CTATTACGCC AGCTGGCGAA 
AGGGGGATGT GCTGCAAGGC GATTAAGTTG GGTAACGCCA GGGTTTTCCC AGTCACGACG 
TTGTAAAACG ACGGCCAGTG AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
GCCCCCCCTC GAGTTGTAAT ACGACTCACT ATAGGGACGA AGACAAACAA ACCATTATTA 
TCATTAAAAG GCTCAGGAGA AACTTTAACA GTkATCAAA ATG TCT ^Wfw 



Met Ser Val Thr Val 
1 5 



AAG AGA ATC ATT GAC AAC ACA GTC ATA GTT CCA AAA CTT CCT rra 
Lys Arg lie lie Asp Asn Thr Val lie Val Pro 2£ £J Pro Ala £1 

15 20 

GAG GAT CCA GTG GAA TAC CCG GCA GAT TAC TTC AGA AAA TCA AAG GAP 
Glu Asp Pro Val Glu Tyr Pro Ala Asp Tyr Phe Art Lys Ser Lys G^u 

33 



ATT CCT CTT TAC ATC AAT ACT ACA AAA AGT TTG TCA GAT CTA AGA GGA 
He Pro Leu Tyr lie Asn Thr Thr Lys Ser Leu Ser A^p £u Arg Gly 
40 45 50 

TAT GTC TAC CAA GGC CTC AAA TCC GGA AAT GTA TCA ATC ATA CAT GTC 
Tyr val Tyr Gin Gly Leu Lys Ser Gly Asn Val Ser 111 lie £s vll 

60 65 

AAC AGC TAC TTG TAT GGA GCA TTA AAG GAC ATC CGG GGT AAG TTG GAT 
Asn Ser Tyr Leu Tyr Gly Ala Leu Lys Asp lie Arg Gly £s Su Sp 

75 80 85 

AAA GAT TGG TCA AGT TTC GGA ATA AAC ATC GGG AAA Gr* rrr r-*-v 

Lys Asp Trp Ser Ser Phe Gly lie Asn lie Gly L^ £a g Sp Thr 

95 100 

ATC GGA ATA TTT GAC CTT GTA TCC TTG AAA GCC CTG GAC GGC GTA CTT 
He. Gly lie Phe Asp Leu Val Ser Leu Lys Ala Leu A^p Gly Val Leu 

105 110 13 _5 

CCA GAT GGA GTA TCG GAT GCT TCC AGA ACC AGC GCA GAT GAC AAA T-G 
Pro Asp Gly Val Ser Asp Ala Ser Arg Thr Ser Ala Asp Asp Tr? 
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1206 



1254 



1350 



120 125 130 

TTQ t ^ MA QGC AGA ACA CAA ATG 

Leu Pro Leu Tyr Leu Leu Gly Leu Tyr Arg Val Gly Arg Thr Gin Met ' 
13 5 140 145 

CCT GAA TAC AGA AAA AAG CTC ATG GAT GGG CTG ACA AAT CAA TGC AAA 
Pro Glu Tyr Arg Lys Lys Leu Met Asp Gly Leu Thr Asn Gin Cys Lys 
150 155 .160 i 6 5 

ATG ATC AAT GAA CAG TTT GAA CCT CTT GTG CCA GAA GGT CGT GAC ATT 13 02 

Met He Asn Glu Gin. Phe Glu Pro Leu Val Pro Glu Gly Arg Asp He 

170 175 . 180 

TTT GAT GTG TGG GGA AAT GAC AGT AAT TAC ACA AAA ATT GTC GCT GCA 
Phe Asp Val Trp Gly Asn Asp Ser Asn Tyr Thr Lys He Val Ala Ala 

185 190 195 

GTG GAC ATG TTC TTC CAC ATG TTC AAA AAA CAT GAA TGT GCC TCG TTC 
Val Asp Met Phe Phe His Met Phe Lys Lys His Glu Cys Ala Ser Phe 
200 205 210 

AGA TAC GGA ACT ATT GTT TCC AGA TTC AAA GAT TGT GCT GCA ^TG G CA 1446 
Arg Tyr Gly Thr He Val Ser Arg Phe -Lys Asp Cys Ala Ala Leu Ala 
215 220 225 

ACA TTT GGA CAC CTC TGC AAA ATA ACC GGA ATG TCT ACA GAA GAT GTA 
Thr Phe Gly His Leu Cys Lys He Thr Gly Met Ser Thr Glu Asp Val ■ 
230 235 240 245 

ACG ACC TGG ATC TTG AAC CGA GAA GTT GCA GAT GAA ATG GTC CAA ATG 
Thr Thr Trp He Leu Asn Arg Glu Val Ala Asp Glu Met Val Gin Met 

250 255 260 



1398 



1494 



1542 



.S90 



1686 



1734 



1782 



ATG CTT CCA GGC CAA CiAA ATT GAC AAG GCC GAT TCA TAC ATG CCT TAT 
Met Leu Pro Gly Gin Glu lie Asp Lys Ala Asp Ser Tyr Met Pro Tyr 

265 270 * 275 

TTG ATC GAC TTT GGA TTG TCT TCT AAG TCT CCA TAT TCT TCC GTC AAA 163S 
Leu He Asp Phe Gly Leu Ser Ser Lys Ser Pro Tyr Ser Ser Val Lys 
280 285 2 90 . 

AAC CCT GCC TTC CAC TTC TGG GGG CAA TTG ACA GCT CTT CTG CTC AGA 
Asn Pro Ala Phe His Phe Trp Gly Gin Leu Thr Ala Leu Leu Leu Ara 
295 300 305 

TCC ACC AGA GCA AGG AAT GCC CGA CAG CCT GAT GAC ATT GAG TAT ACA 
Ser Thr Arg Ala Arg Asn Ala Arg Gin Pro Asp Asp He Glu Tyr Thr 
310 3 15 320 325 

TCT CTT ACT ACA GCA GGT TTG TTG TAC GCT TAT GCA GTA GGA TCC TCT 
Ser Leu Thr Thr Ala Gly Leu Leu Tyr Ala Tyr Ala Val Gly Ser Ser 

330 335 340 

GCC GAC TTG GCA CAA CAG TTT TGT GTT GGA GAT AAC AAA TAC ACT CCA 
Ala Asp Leu Ala Gin Gin Phe Cys Val Gly Asp Asn Lys Tyr Thr Pro 

345 350 ' 355 

GAT GAT AGT ACC GGA GGA TTG ACG ACT AAT GCA CCG CCA CAA GGC AGA 
Asp Asp Ser Thr Gly Gly Leu Thr Thr Asn Ala Pro Pro Gin Gly Ara 
360 365 370 

GAT GTG GTC GAA TGG CTC GGA TGG TTT GAA GAT CAA AAC AGA AAA CCG 
Asp Val Val Glu Trp. Leu Gly Trp Phe Glu Asp Gin Asn Arg Lys P-o 
3 75 380 385 Y 

ACT CCT GAT ATG ATG CAG TAT GCG AAA AGA GCA GTC ATG TCA CTG CAA 
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Thr Pro Asp Met Met Gin Tyr Ala Lys Arg Ala Val Met Ser Leu Gin 
390 395 400 4 05 

GGC CTA AGA GAG AAG ACA ATT GGC AAG TAT GCT ' AAG TCA GAA TTT GAC 2022 
Gly Leu Arg Glu Lys Thr He Gly Lys Tyr Ala Lys Ser Glu Phe Asp 

410 415 " 420 

AAA TGA CCCTATAATT CTCAGATCAC CTATTATATA TTATG CTACA TATGAAAAAA 2078 
Lys 

ACTAACAGAT ATC ATG GAT AAT CTC ACA AAA GTT CGT GAG TAT CTC AAG 2127 

Met Asp Asn Leu Thr Lys Val Arg Glu Tyr Leu Lys 

TCC TAT TCT CGT CTG GAT CAG GCG GTA GGA GAG ATA GAT GAG ATC GAA 2175 
Ser Tyr Ser Arg Leu Asp Gin Ala Val Gly Glu He Asp Glu He Glu 
15 20 . 25 

GCA CAA CGA GCT GAA AAG TCC AAT TAT GAG TTG TTC CAA GAG GAT GGA 2223 
Ala Gin Arg Ala Glu Lys Ser Asn Tyr Glu Leu Phe Gin Glu Asp Glv 
30 35 40 

GTG GAA GAG CAT ACT AAG CCC TCT TAT TTT CAG GCA GCA GAT GAT TCT 22 71 

Val Glu Glu His Thr Lys Pro Ser Tyr Phe Gin Ala Ala Asp Asp Ser 
45 50 55 60 

GAC ACA GAA TCT GAA CCA GAA ATT GAA GAC AAT CAA GGT TTG TAT GCA 2319 
Asp Thr Glu Ser Glu Pro Glu He Glu Asp Asn Gin Gly Leu Tyr Ala 

65 70 75 

CCA GAT CCA GAA GCT GAG CAA GTT GAA GGC TTT ATA CAG GGG CCT TTA 2367 
Pro Asp Pro Glu Ala Glu Gin Val Glu Gly Phe He Gin Gly Pro Leu 

80 85 90 

GAT GAC TAT GCA GAT GAG GAA GTG GAT GTT GTA TTT ACT TCG GAC TGG 2415 
Asp Asp Tyr Ala Asp Glu Glu Val Asp Val Val Phe Thr Ser Asp Trp 
95 100 105 

AAA CAG CCT GAG CTT GAA TCT GAC GAG CAT GGA AAG ACC- TTA CGG TTG 2463 
Lys Gin Pro Glu Leu Glu Ser Asp Glu His Gly Lys Thr Leu Arq Leu 
HO 115 120 

ACA TCG CCA GAG GGT TTA AGT GGA GAG CAG AAA TCC CAG TGG CTT TCG 2511 
Thr ser Pro Glu Gly Leu Ser Gly Glu Gin Lys Ser Gin Trp Leu Ser 
125 130 ■ 135 140 

ACG ATT AAA GCA GTC GTG CAA AGT GCC AAA TAC TGG AAT CTG GCA GAG 2 5 59 

Thr He Lys Ala Val Val Gin Ser Ala Lys Tyr Trp Asn Leu Ala Glu 

145 150 155 

TGC ACA TTT GAA GCA TCG GGA GAA GGG GTC ATT ATG AAG GAG CGC CAG 2607 
Cys Thr Phe Glu Ala Ser Gly Glu Gly Val He Met Lys Glu Arq Gin 

160 165 170 

ATA ACT CCG GAT GTA TAT AAG GTC ACT CCA GTG ATG AAC ACA CAT CCG 2655 
He Thr Pro Asp Val Tyr Lys Val Thr Pro Val Met Asn Thr His Pro ^ 
175 180 185 

TCC CAA TCA GAA GCA GTA TCA GAT GTT TGG TCT CTC TCA AAG ACA TCC 
Ser Gin Ser Glu Ala Val Ser Asp Val. Trp Ser Leu Ser Lys Thr Ser 
190 195 200 

ATG ACT TTC CAA CCC AAG AAA GCA AGT CTT CAG CCT CTC ACC ATA TCC 
Met Thr Phe Gin Pro Lys Lys Ala Ser Leu Gin Pro Leu Thr He Ser 
205 210 215 220 
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TTG GAT GAA TTG TTC TCA TCT AGA GGA GAG TTC ATC TCT GTC GGA GGT 2799 
Leu Asp Glu Leu Phe Ser Ser Arg Gly Glu Phe lie Ser Val Gly Gly 

225 230 235 

GAC GGA CGA ATG TCT CAT AAA GAG GCC ATC CTG CTC GGC CTG AGA TAC 2847 
Asp Gly Arg Met Ser His Lys Glu Ala lie Leu Leu Gly Leu Arg Tyr 

240 245 250 

AAA AAG TTG TAC AAT CAG GCG AGA GTC AAA TAT TCT CTG TAG 2889 
Lys Lys Leu Tyr Asn Gin Ala Arg Val Lys Tyr Ser Leu 
255 260 ^ 265 

ACTATGAAAA AAAGTAACAG ATATCACGAT CTAAGTGTTA TCCCAATCCA TTCATC 294 5 

ATG AGT TCC TTA AAG AAG ATT CTC GGT CTG AAG GGG AAA GGT AAG AAA 2 993 

Met Ser Ser Leu Lys Lys He Leu Gly Leu Lys Gly Lys Gly Lys Lys 
1 5 io " 15 

TCT AAG AAA TTA GGG ATC GCA CCA CCC CCT TAT GAA GAG GAC ACT AGC 3041 
Ser Lys Lys Leu Gly He Ala Pro Pro Pro Tyr Glu Glu Asp Thr Ser 

20 25 30 

ATG GAG TAT GCT CCG AGC GCT CCA ATT GAC AAA TCC TAT TTT GGA GTT '3089 
Met Glu Tyr Ala Pro Ser Ala Pro He Asp Lys Ser Tyr Phe Gly Val 
35 40 " 45 

GAC GAG ATG GAC ACC TAT GAT CCG AAT CAA TTA AGA TAT GAG AAA TTC 313 7 

Asp Glu Met Asp Thr Tyr Asp Pro Asn Gin Leu Arg Tyr Glu Lys Phe 
50 55 60 

TTC TTT ACA GTG AAA ATG ACG GTT AGA TCT AAT CGT CCG TTC AGA ACA 3185 
Phe Phe Thr Val Lys Met Thr Val Arg Ser Asn Arg Pro Phe Arg Thr 
65 70 75 *~ 80 

TAC TCA GAT GTG GCA GCC GCT GTA TCC CAT TGG GAT CAC ATG TAC ATC 2233 
Tyr Ser Asp Val Ala Ala Ala Val Ser His Trp Asp His Met Tyr He 

85 90 95 

GGA ATG GCA GGG AAA CGT CCC TTC TAC AAA ATC TTG GCT TTT TTG GGT 3281 
Gly Met Ala Gly Lys Arg Pro Phe Tyr Lys He Leu Ala Phe Leu Gly 

100 105 no 

TCT TCT AAT CTA AAG GCC ACT CCA GCG GTA TTG GCA GAT CAA GGT CAA 3329 
Ser Ser Asn Leu Lys Ala Thr Pro Ala Val Leu Ala Asp Gin Gly Gin 
115 • 120 125 

CCA GAG TAT CAC ACT CAC TGC GAA GGC AGG GCT TAT TTG CCA CAT AGG 3 377 

Pro Glu Tyr His Thr His Cys Glu Glv Arg Ala Tyr Leu Pro His Arg 
130 135 140 

ATG GGG AAG ACC CCT CCC ATG CTC AAT GTA CCA GAG CAC TTC AGA AGA 3425 
Met Gly Lys Thr Pro Pro Met Leu Asn Val Pro Glu His Phe Arg Arg 
145 150 155 160 

CCA TTC AAT ATA GGT CTT TAC AAG GGA ACG ATT GAG CTC ACA ATG ACC 34 73 

Pro Phe Asn He Gly Leu Tyr. Lys Gly Thr He Glu Leu Thr Met Thr 

165 170 175 

ATC TAC GAT GAT GAG TCA CTG GAA GCA GCT CCT ATG ATC TGG GAT CAT 3 521 

He Tyr Asp Asp Glu Ser Leu Glu Ala Ala Pro Met He Trp Asp His 

180 185 190 

TTC AAT TCT TCC AAA TTT TCT GAT TTC AGA GAG AAG GCC TTA ATG TTT 3 56 9 

Phe Asn Ser Ser Lys Phe Ser Asp Phe Arg Glu Lys Ala Leu Met Phe 
195 200 205 

GGC CTG ATT GTC GAG AAA AAG GCA TCT GGA GCG TGG GTC CTG GAT TCT 3617 
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Gly Leu He Val Glu Lys Lys Ala Ser Gly. Ala Trp Val Leu Asp Ser 
210 215 220 

ATC AGC CAC TTC AAA TGA GCTAGTCTAA CTTCTAGCTT CTGAACAATC 3665 

He Ser His Phe Lys 

225 

CCCGGTTTAC TCAGTCTCTC CTAATTCCAG CCTCTCGAAC AACTAATATC CTGTCTTTTC 3725 

TATCCCTATG AAAAAAACTA ACAGAGATCG ATCTGTTTAC GCGTCACT ATG AAG TGC 3782 

Met Lys Cys 
1 

CTT TTG TAC TTA GCC TTT TTA TTC ATT GGG GTG AAT TGC AAG TTC ACC 3830 
Leu Leu Tyr Leu Ala Phe Leu Phe He Gly Val Asn Cys Lys Phe Thr 
5 10 15 

ATA GTT TTT CCA CAC AAC CAA AAA GGA AAC TGG AAA AAT GTT CCT TCT 38 78 

He Val Phe Pro His Asn Gin Lys Gly Asn Trp Lys Asn Val Pro Ser 
20 25 . 30 35 

AAT TAC CAT TAT TGC CCG TCA AGC TCA GAT TTA AAT TGG CAT AAT GAC 3926 
Asn Tyr His Tyr Cys Pro Ser Ser Ser Asp Leu Asn Trp His Asn Asp 

40 45 ~ 50 

TTA ATA GGC ACA GCC ATA CAA GTC AAA ATG CCC AAG AGT CAC AAG GCT 3974 
Leu lie Gly Thr Ala lie Gin Val Lys Met Pro Lys Ser His Lys Ala 

55 60 65 

ATT CAA GCA GAC GGT TGG ATG TGT CAT GCT TCC AAA TGG GTC ACT ACT 4022 
lie Gin Ala Asp Gly Trp Met Cys His Ala Ser Lys Trp Val Thr Thr 
70 75 80 

TGT GAT TTC CGC TGG TAT GGA CCG AAG TAT ATA ACA CAG TCC ATC PGA . /p — 

Cys Asp Phe Arg Trp Tyr Gly Pro Lys Tyr lie Thr Gin Ser He Ara 
85 90 95 3 

TCC TTC ACT CCA TCT GTA GAA CAA TGC AAG GAA AGC ATT GAA CAA ACG 4118 
Ser Phe Thr Pro Ser Val Glu Gin Cys Lys Glu Ser lie Glu Gin Thr 
100 105 no us 

AAA CAA GGA ACT TGG CTG -AAT CCA GGC TTC CCT CCT CAA AGT TGT GGA 
Lys Gin Gly Thr Trp Leu Asn Pro Gly Phe Pro Pro Gin Ser Cys Glv 

120 125 ilo 

TAT GCA ACT GTG ACG " GAT GCC GAA GCA GTG ATT GTC CAG GTG ACT CCT 4214 
Tyr Ala Thr Val Thr Asp Ala Glu Ala Val He" Val Gin Val Thr Pro " 

135 140 145 

CAC CAT GTG CTG GTT GAT GAA TAC ACA GGA GAA TGG GTT GAT TCA CAG 4262 
His His Val Leu Val Asp Glu Tyr Thr Gly Glu Trp Val Asp Ser Gin 
150 155 160 

TTC ATC AAC GGA AAA TGC AGC AAT TAC ATA TGC CCC ACT GTC CAT AAC ' 
Phe He Asn Gly Lys Cys Ser Asn Tyr He Cys Pro Thr Val His Asn 
165 170 175 



4166 



4310 



TCT ACA ACC TGG CAT TCT GAC TAT AAG GTC AAA GGG CTA TGT GAT TCT 43 58 

Ser Thr Thr Trp His Ser Asp Tyr Lys Val Lys Gly Leu Cys Asp Ser 
180 185 190 195 

AAC CTC ATT TCC ATG GAC ATC ACC TTC . TTC TCA GAG GAC GGA GAG CTA 44 06 

Asn Leu He Ser Met Asp lie Thr Phe Phe Ser Glu Asp Gly Glu Leu 

200 205 210 

TCA TCC CTG GGA AAG GAG GGC ACA GGG TTC AGA AGT AAC TAC TTT GC^ 
Ser Ser Leu Gly Lys Glu Gly Thr Gly Phe Arg Ser Asn Tyr Phe Ala 
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215 220 225 

TAT GAA ACT GGA GGC AAG GCC TGC AAA ATG CAA TAC TGC AAG CAT TGG 45 02 

Tyr Glu Thr Gly Gly Lys Ala Cys Lys Met Gin Tyr Cys Lys His Trp 
.230 235 J 240 

GGA GTC AGA CTC CCA TCA GGT . GTC TGG TTC GAG ATG GCT GAT AAG GAT 4 5 5 0 

Gly Val Arg Leu Pro Ser Gly Val Trp Phe Glu Met Ala Asp Lys Asp 
245 250 255 

CTC TTT GCT GCA GCC AGA TTC CCT GAA TGC CCA GAA GGG TCA AGT ATC 4598 
Leu Phe Ala Ala Ala Arg Phe Pro Glu Cys Pro Glu Gly Ser Ser lie 
260 265 270 " 275 

TCT GCT CCA TCT CAG ACC TCA GTG GAT GTA AGT CTA ATT CAG GAC GTT 4646 
Ser Ala Pro Ser Gin Thr Ser Val Asp Val Ser Leu lie Gin Asp Val 

280 285 290 

GAG AGG ATC TTG GAT TAT TCC CTC TGC CAA GAA ACC TGG AGC AAA ATC 4694 
Glu Arg lie Leu Asp Tyr Ser Leu Cys Gin Glu Thr Trp Ser Lys lie 

295 300 305 

AGA GCG GGT CTT CCA ATC TCT CCA GTG GAT CTC AGC TAT CTT GCT CCT 4 742 

Arg Ala Gly Leu Pro lie Ser Pro Val Asp Leu Ser Tyr Leu Ala Pro 
310 315 " 320 

AAA AAC CCA GGA ACC GGT CCT GCT TTC ACC ATA ATC AAT GGT ACC CTA 4790 
Lys Asn Pro Gly Thr Gly Pro Ala Phe Thr He He Asn Gly Thr Leu 
325 330 335 

AAA TAC TTT GAG ACC AGA TAC ATC AGA GTC GAT ATT GCT GCT CCA ATC 4 838 

Lys Tyr Phe Glu Thr Arg Tyr He Arg Val Asp He Ala Ala Pro He 
340 345 350 355 

CTC TCA AGA ATG GTC GGA ATG ATC AGT GGA ACT ACC ACA GAA AGG GAA 4 886 

Leu Ser Arg Met Val Gly Met He Ser Gly Thr Thr Thr Glu Arg Glu 

360 365 370 

CTG TGG GAT GAC TGG GCA CCA TAT GAA GAC GTG GAA ATT GGA CCC AAT 4934 
Leu Trp Asp Asp Trp Ala Pro Tyr Glu Asp Val Glu He Gly Pro Asn 

375 380 * 385 

GGA GTT CTG AGG ACC AGT TCA GGA TAT AAG TTT CCT TTA TAC ATG ATT 4 982 

Gly Val Leu Arg Thr Ser Ser Gly Tyr Lys Phe Pro Leu Tyr Met He 
390 395 * 400 

GGA CAT GGT ATG TTG GAC TCC GAT CTT CAT CTT AGC TCA AAG GCT CAG 503 0 

Gly His Gly Met Leu Asp Ser Asp Leu His Leu Ser Ser Lys Ala Gin 
405 410 415 

GTG TTC GAA CAT CCT CAC ATT CAA GAC GCT GCT TCG CAA CTT CCT GAT 5078 
Val Phe Glu His Pro His He Gin Asp Ala Ala Ser Gin Leu Pro Asp 
420 425 430 435 

GAT GAG AGT TTA TTT TTT GGT GAT ACT GGG CTA TCC AAA AAT CCA ATC 5126 
Asp Glu Ser Leu Phe Phe Gly Asp Thr Gly Leu Ser Lys Asn Pro He 

440 445 ~ 450 

GAG CTT GTA GAA GGT TGG TTC AGT AGT TGG AAA AGC TCT ATT GCC TCT 5174 
Glu Leu Val Glu Gly Trp Phe Ser Ser Trp Lys Ser Ser lie Ala Ser 

455 • 460 465 

TTT TTC TTT ATC ATA GGG TTA ATC ATT GGA CTA TTC TTG GTT CTC CGA 5222 
Phe Phe Phe He He Gly Leu lie lie Gly Leu Phe Leu Val Leu Arg 
470 475 480 

GTT GGT ATC CAT CTT TGC ATT AAA TTA AAG CAC ACC AAG AAA AGA CAG 527 0 
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Val Gly He His Leu Cys He Lys Leu Lys His Thr Lys Lys Arg Gin 
485 4 90 495 " 

ATT TAT ACA GAC ATA GAG ATG AAC CGA CTT GGA AAG TAA CTCAAATCCT 5319 
He Tyr Thr Asp He Glu Met Asn Arg Leu Gly Lys 
500 505 ~ 510 

GCTAGCCAGA TTCTTCATGT TTGGACCAAA TCAACTTGTG ATACCATGCT CAAAGAGGCC 53 79 

TCAATTATAT TTGAGTTTTT AATTTTTATG AAAAAAACTA ACAGCAATC ATG GAA 54 34 

Met Glu 
1 

GTC CAC GAT TTT GAG ACC GAC GAG TTC AAT GAT TTC AAT GAA GAT GAC 54 82 

Val His Asp Phe Glu Thr Asp Glu Phe Asn Asp Phe Asn Glu Asp Asp 
5 10 15 

TAT GCC ACA AGA GAA TTC CTG AAT CCC GAT GAG CGC ATG ACG TAC TTG 5530 
Tyr Ala Thr Arg Glu Phe Leu Asn Pro Asp Glu Arg Met Thr Tyr Leu 
20 25 "30 

AAT CAT GCT GAT TAC AAT TTG AAT TCT CCT CTA ATT AGT GAT GAT ATT 5578 
Asn His Ala Asp Tyr Asn Leu Asn Ser Pro Leu He Ser Asp Asp lie 
35 40 45 . 50 

GAC AAT TTG ATC AGG AAA TTC AAT TCT CTT CCG ATT CCC TCG ATG TGG 
Asp Asn Leu He Arg Lys Phe Asn Ser Leu Pro He Pro Ser Met Trp 

5 5 60 65 

GAT AGT AAG AAC TGG GAT GGA GTT CTT GAG ATG TTA ACA TCA TGT CAA 5674 
Asp Ser Lys Asn Trp Asp Gly Val Leu Glu Met Leu Thf Ser Cys Gin 

70 75 80 

GCC AAT CCC ATC TCA ACA TCT CAG ATG CAT AAA TGG ATG GGA AGT TGG' *"2~ 
Axa Asn Pro He Ser Thr Ser Gin Met His Lys Trp Met Gly Ser Trp " 
85 90 " 95 

TTA ATG TCT GAT AAT CAT GAT GCC AGT CAA GGG TAT AGT TTT TTA CAT 
Leu Met Ser Asp Asn His Asp Ala Ser Gin Gly Tyr Ser Phe Leu His 
100 105 no 

GAA GTG GAC AAA GAG GCA GAA ATA ACA TTT GAC GTG GTG GAG ACC TTC 5818 
Glu Val Asp Lys Glu Ala Glu He Thr Phe Asp Val Val Glu Thr Phe 
115 120 125 130 

ATC CGC GGC TGG GGC AAC AAA CCA ATT GAA TAC ATC AAA AAG GAA AGA 5866 
He Arg Gly Trp Gly Asn Lys Pro He Glu Tyr He Lys Lys Glu Arg 

135 140 145 

TGG ACT GAC TCA TTC AAA ATT CTC GCT TAT TTG TGT CAA AAG TTT TTG 5914 
Trp Thr Asp Ser Phe Lys lie Leu Ala Tyr Leu Cys Gin Lys Phe Leu 

150 155 i 6 o 

GAC TTA CAC AAG TTG ACA TTA ATC TTA AAT GCT GTC TCT GAG GTG GAA 
Asp Leu His Lys Leu Thr Leu He Leu Asn Ala Val Ser Glu Val Glu 
. 165 170 175 

TTG CTC AAC TTG GCG AGG ACT TTC AAA GGC AAA GTC AGA AGA AGT TCT 6 010 

Leu Leu Asn Leu Ala Arg Thr Phe Lys Gly Lys Val Arg Arg Ser Ser 
180 185 - 190 

CAT GGA ACG AAC ATA TGC AGG ATT AGG GTT CCC AGC TTG GGT CCT ACT 6 05 8 

His Gly Thr Asn lie Cys Arg lie Arg Val Pro Ser Leu Gly Pro Thr 
195 200 205 " 210 

TTT ATT TCA GAA GGA TGG GCT TAC TTC AAG AAA CTT GAT ATT CTA ATG 6106 
Phe lie Ser Glu Gly Trp Ala Tyr Phe Lys Lys Leu Asp lie Leu Met 
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215 220 225 

GAC CGA AAC TTT CTG TTA ATG GTC AAA GAT GTG ATT ATA GGG AGG ATG 6154 
Asp Arg Asn Phe Leu Leu Met Val Lys Asp Val lie lie Gly Arg Met 

23 0 23 5 24 0 

CAA ACG GTG CTA TCC ATG GTA TGT AGA ATA GAC AAC CTG TTC TCA GAG 6202 
Gin Thr Val Leu Ser Met Val Cys Arg lie Asp Asn Leu Phe Ser Glu 
245 250 255 

CAA GAC ATC TTC TCC CTT CTA AAT ATC TAC AGA ATT GGA GAT AAA ATT 6250 
Gin Asp lie Phe Ser Leu Leu Asn lie Tyr Arg lie Gly Asp Lys lie 
260 265 270 

GTG GAG AGG CAG GGA AAT TTT TCT TAT GAC TTG ATT AAA ATG GTG GAA 6298 
Val Glu Arg Gin Gly Asn Phe Ser Tyr Asp Leu lie Lys Met Val Glu 
275 280 285 290 

CCG ATA TGC AAC TTG AAG CTG* ATG AAA TTA GCA AGA GAA TCA AGG CCT 6346 
Pro lie Cys Asn Leu Lys Leu Met Lys Leu Ala Arg Glu Ser Arg Pro 

295 300 305 

TTA GTC CCA CAA TTC CCT CAT TTT GAA AAT CAT ATC AAG ACT TCT GTT 6394 
Leu Val Pro Gin Phe Pro His Phe Glu Asn His lie Lys Thr Ser Val 

310 315 " 320 

GAT GAA GGG GCA AAA ATT GAC CGA GGT ATA AGA TTC CTC CAT GAT CAG 6442 
Asp Glu Gly Ala Lys lie Asp Arg Gly lie Arg Phe Leu His Asp Gin 
325 330 335 

ATA ATG AGT GTG AAA ACA GTG GAT CTC ACA CTG GTG ATT TAT GGA TCG 64 90 

lie Met Ser Val Lys Thr Val Asp Leu Thr Leu Val lie Tyr Gly Ser 
340 345 " 350 

TTC AGA CAT TGG GGT CAT CCT TTT ATA GAT TAT TAC ACT GGA CTA GAA 653 8 

Phe Arg His Trp Gly His Pro Phe lie Asp Tyr Tyr Thr Gly Leu Glu 
355 360 365 370 * 

AAA TTA CAT TCC CAA GTA ACC ATG AAG AAA GAT ATT GAT GTG TCA TAT 6586 
Lys Leu His Ser Gin Val Thr Met Lys Lys Asp lie Asp Val Ser Tyr 

375 380 " 385 

GCA AAA GCA CTT GCA AGT GAT TTA GCT CGG ATT GTT CTA TTT CAA CAG 6634 
Ala Lys Ala Leu Ala Ser Asp Leu Ala Arg lie Val Leu Phe Gin Gin 

390 395 400 

TTC AAT GAT CAT AAA AAG TGG TTC GTG AAT GGA GAC TTG CTC CCT CAT 66 82 

Phe Asn Asp His Lys Lys Trp Phe Val Asn Gly Asp Leu Leu Pro His 
405 410 ~ 415 

GAT CAT CCC TTT AAA AGT CAT GTT AAA GAA AAT ACA TGG CCC ACA GCT 673 0 

Asp His Pro Phe Lys Ser His Val Lys Glu Asn Thr Trp Pro Thr Ala 
420 425 430 

GCT CAA GTT CAA GAT TTT GGA GAT AAA TGG CAT GAA CTT CCG CTG ATT 677 8 

Ala Gin Val Gin Asp Phe Gly Asp Lys Trp His Glu Leu Pro Leu lie 
435 440 445 450 

AAA TGT TTT GAA ATA CCC GAC TTA CTA GAC CCA TCG ATA ATA TAC TCT 6 826 

Lys Cys Phe Glu lie Pro Asp Leu Leu Asp Pro Ser lie lie Tyr Ser 

455 460 465 

GAC AAA AGT CAT TCA ATG AAT AGG TCA GAG GTG TTG AAA CAT GTC CGA 6 874 

Asp Lys Ser His Ser Met Asn Arg Ser Glu Val Leu Lys His Val Arg 

4 70 4 75 480 

ATG AAT CCG AAC ACT CCT ATC CCT AGT AAA AAG GTG TTG CAG ACT ATG 6922 
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Met Asn Pro Asn Thr Pro He Pro Ser Lys Lys Val Leu Gin Thr Met 
4 85 490 495 

TTG GAC ACA AAG GCT ACC AAT TGG AAA GAA TTT CTT AAA GAG ATT GAT 6 970 

Leu Asp Thr Lys Ala Thr Asn Trp Lys Glu Phe Leu Lys Glu He Asp 
500 505 510 

GAG AAG GGC TTA GAT GAT GAT GAT CTA ATT ATT GGT CTT AAA GGA AAG 7018 
Glu Lys Gly Leu Asp Asp Asp Asp Leu He He Gly Leu Lys Gly Lys 
515 520 525 * * 530 

GAG AGG GAA CTG AAG TTG GCA GGT AGA TTT TTC TCC CTA ATG TCT TGG 7066 
Glu Arg Glu Leu Lys Leu Ala Gly Arg Phe Phe Ser Leu Met Ser Trp 

535 540 545 

AAA TTG CGA GAA TAC TTT GTA ATT ACC GAA TAT TTG ATA AAG ACT CAT 7114 
Lys Leu Arg Glu Tyr Phe Val He Thr Glu Tyr Leu He Lys Thr His 

550 555 560 

TTC GTC CCT ATG TTT AAA GGC CTG ACA ATG GCG GAC GAT CTA ACT GCA 7162 
Phe Val Pro Met Phe Lys Gly Leu Thr Met Ala Asp Asp Leu Thr Ala 
555 570 575 

GTC ATT AAA AAG ATG TTA GAT TCC TCA TCC GGC CAA GGA TTG AAG TCA 7210 
•Val He Lys Lys Met Leu Asp Ser Ser Ser Gly Gin Gly Leu Lys Ser 
580 585 590 

TAT GAG GCA ATT TGC ATA GCC AAT CAC ATT GAT TAC GAA AAA TGG AAT 7258 
Tyr Glu Ala He Cys He Ala Asn His He Asp Tyr Glu Lys Trp Asn 
595 500 605 " 610 

AAC CAC CAA AGG AAG TTA TCA AAC GGC CCA GTG TTC CGA GTT ATG GGC ' 7306 
Asn .His Gin Arg Lys Leu Ser Asn Gly Pro Val Phe Arg Val Met Gly ■ 

- '•- 615 620 625 

CAG TTC TTA GGT TAT CCA TCC TTA ATC GAG AGA ACT CAT GAA TTT TTT 7354 
Gin Phe Leu Gly Tyr Pro Ser Leu He Glu Arg Thr His Glu Phe Phe 

630 635 640 

GAG AAA AGT CTT ATA TAC TAC AAT GGA AGA CCA GAC TTG ATG CGT GTT 7402 
Glu Lys Ser Leu He Tyr Tyr Asn Gly Arg Pro Asp Leu Met Arg Val 
64 5 650 655 



7450 



CAC AAC AAC ACA CTG ATC AAT TCA ACC TCC CAA CGA GTT TGT TGG CAA 
His Asn Asn Thr Leu He Asn Ser Thr Ser Gin Arg Val Cys Trp Gin 
660 665 670 

GGA CAA GAG GGT GGA CTG GAA GGT CTA CGG CAA AAA GGA TGG ACT ATC 7498 
Gly Gin Glu Gly Gly Leu Glu Gly Leu Arg Gin Lys Gly Trp Thr He 
675 680 685. 690 

CTC AAT CTA CTG GTT ATT CAA AGA GAG GCT AAA ATC AGA AAC ACT GCT 754 6 

Leu Asn Leu Leu Val He Gin Arg Glu Ala Lys He Arg Asn Thr Ala 

695 700 70S 

GTC AAA GTC TTG GCA CAA GGT GAT AAT CAA GTT ATT TGC ACA CAG TAT 75 94 

Val Lys Val Leu Ala Gin Gly Asp Asn Gin Val He Cys Thr Gin Tyr 

71 ° 715 720 

AAA ACG AAG AAA TCG AGA AAC GTT GTA GAA TTA CAG GGT GCT CTC AAT 
Lys Thr Lys Lys Ser Arg Asn Val Val Glu Leu Gin Gly Ala Leu Asn 
725 730 735 

CAA ATG GTT TCT AAT AAT GAG AAA ATT ATG ACT GCA ATC AAA ATA GGG 76 90 

Gin- Met Val Ser Asn Asn Glu Lys He Met Thr Ala lie Lys He Gly 
740 745 750 Y 
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ACA GGG AAG TTA GGA CTT TTG ATA AAT GAC GAT GAG ACT ATG CAA TCT 773 8 

Thr Gly Lys Leu Gly Leu Leu He Asn Asp Asp Glu Thr Met Gin Ser 
755 760 765 770 

GCA GAT TAC TTG AAT TAT GGA AAA ATA CCG ATT TTC CGT GGA GTG ATT 7 786 

Ala Asp Tyr Leu Asn Tyr Gly Lys He Pro He Phe Arg Gly Val He 

775 780 ~ 785 

AGA GGG TTA GAG ACC AAG AGA TGG TCA CGA GTG ACT TGT GTC ACC AAT 7834 
Arg Gly Leu Glu Thr Lys Arg Trp Ser Arg Val Thr Cys Val Thr Asn 

790 795 800 

GAC CAA ATA CCC ACT TGT GCT AAT ATA ATG AGC TCA GTT TCC ACA AAT 78 82 

Asp Gin He Pro Thr Cys Ala Asn He Met Ser Ser Val Ser Thr Asn 
805 810 815 

GCT CTC ACC GTA GCT CAT TTT GCT GAG AAC CCA ATC AAT GCC ATG ATA 7 93 0 

Ala Leu Thr Val Ala His Phe Ala Glu Asn Pro lie Asn Ala Met He 
820 825 830 

CAG TAC AAT TAT TTT GGG ACA TTT GCT AGA CTC TTG TTG ATG ATG CAT 7978 
Gin Tyr Asn Tyr Phe Gly Thr Phe Ala Arg Leu Leu Leu Met Met His 
835 840 845 850 

GAT CCT GCT CTT CGT CAA TCA TTG TAT GAA GTT CAA GAT AAG ATA CCG ' 8026 
Asp Pro Ala Leu Arg Gin Ser Leu Tyr Glu Val Gin Asp Lys lie Pro 

855 860 865 

GGC TTG CAC AGT TCT ACT TTC AAA ' TAC GCC ATG TTG TAT TTG GAC CCT 8 074 

Gly Leu His Ser Ser Thr Phe Lys Tyr Ala Met Leu Tyr Leu Asp Pro 

870 875 ' 880 

TCC ATT GGA GGA GTG TCG GGC ATG TCT TTG TCC AGG TTT TTG ATT AGA 8122 
Ser lie Gly Gly V-:.l Ser Gly Met Se - Leu Vier Ar tve Leu He Arg . 
885 890 ' 895 

GCC TTC CCA GAT CCC GTA ACA GAA AGT CTC TCA TTC TGG AGA TTC ATC 8170 
Ala Phe Pro Asp Pro Val Thr Glu Ser Leu Ser Phe Trp Arg Phe lie 
900 905 sio 

CAT GTA CAT GCT CGA AGT GAG CAT CTG AAG GAG ATG AGT GCA GTA TTT B218 
His Val His Ala Arg Ser Glu His Leu Lys Glu Met Ser Ala Val Phe 
915 920 925 930 

GGA AAC CCC GAG ATA GCC AAG TTT CGA ATA ACT CAC ATA GAC AAG CTA 8266 
Gly Asn Pro Glu lie Ala Lys Phe Arg lie Thr His He Asp Lys Leu 

935 940 945 

GTA GAA GAT CCA ACC TCT CTG AAC ATC GCT ATG GGA ATG AGT CCA GCG 8314 
Val Glu Asp Pro Thr Ser Leu Asn lie Ala Met Gly Met Ser Pro Ala 

950 955 960 

AAC TTG TTA AAG ACT GAG GTT AAA AAA TGC TTA ATC GAA TCA AGA CAA 8362 
Asn Leu Leu Lys Thr Glu Val Lys Lys Cys Leu lie Glu Ser Arg Gin 
965 970 975 

ACC ATC AGG AAC CAG GTG ATT AAG GAT GCA ACC ATA TAT TTG TAT CAT 84^0 
Thr lie Arg Asn Gin Val lie Lys Asp Ala Thr lie Tyr Leu Tyr His 
980 985 990 

GAA GAG GAT CGG CTC AGA AGT TTC TTA TGG TCA ATA AAT CCT CTG TTC 84 58 

Glu Glu Asp Arg Leu Arg Ser Phe Leu Trp Ser lie Asn Pro Leu Phe 
555 1000 1005 ioi0 

CCT AGA TTT TTA AGT GAA TTC AAA TCA GGC ACT TTT TTG GGA GTC GCA 8506 
Pro Arg Phe Leu Ser Glu Phe Lys Ser Gly Thr Phe Leu Gly Val A"> a 

1015 1020 1025 " 
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GAC GGG CTC ATC AGT CTA TTT CAA AAT TCT CGT ACT ATT CGG AAC TCC 8554 
Asp Gly Leu He Ser Leu Phe Gin Asn Ser Arg Thr He Arg Asn Ser 

103 0 1035 1040 

TTT AAG AAA AAG TAT CAT AGG GAA TTG GAT GAT TTG ATT GTG AGG AGT 86 02 

Phe Lys Lys Lys Tyr His Arg Glu Leu Asp Asp Leu He Val Arg Ser 
1045 1050 1055 

GAG GTA TCC TCT TTG ACA CAT TTA GGG AAA CTT CAT TTG AGA AGG GGA 
Glu Val Ser Ser Leu Thr His Leu Gly Lys 'Leu His Leu Arg Arg Gly 
1060 1065 1070 

TCA TGT AAA ATG TGG ACA TGT TCA GCT ACT CAT GCT GAC ACA TTA AGA 86 98 

Ser Cys Lys Met Trp Thr Cys Ser Ala Thr His Ala Asn Thr Leu Arg 
1015 1080 1085 " 1090 

TAC AAA TCC TGG GGC CGT ACA GTT ATT GGG ACA ACT GTA CCC CAT CCA 874* 
Tyr Lys Ser Trp Gly Arg Thr Val He Gly Thr Thr Val Pro His Pro 

1095 iioo ii 0 5 

TTA GAA ATG TTG GGT CCA CAA CAT CGA AAA GAG ACT CCT TGT GCA CCA B794 
Leu Glu Met Leu Gly Pro Gin His Arg Lys Glu Thr Pro Cys Ala Pro 

H10 ins 1120 

TGT AAC ACA TCA GGG TTC AAT TAT GTT TCT GTG CAT TGT CCA GAC GGG 
Cys Asn Thr Ser Gly Phe Asn Tyr Val Ser Val His Cys Pro Asp Gly 
H25 H30 1135 

ATC CAT GAC GTC TTT AGT TCA CGG GGA CCA TTG CCT GCT TAT CTA GGG 

?J"!„ ASP Phe Ser Ser Arg Gl y Pro Leu Pro Ala Tyr Leu Gly 

1140 H45 1150 

TCT AAA ACA TCT GAA TCT ACA TCT ATT TTG CAG CCT TGG GAA AGG GAA 
S^r Lys Thr Ser Glu Ser Thr. Ser He Leu Gin Pro Trp Glu Arg Glu 
1155 " 1160 1165 " nvo 

AGC AAA GTC CCA CTG ATT AAA AGA GCT ACA CGT CTT AGA GAT GCT ATC 
Ser Lys Val Pro Leu lie Lys Arg Ala Thr Arg Leu Arg Asp Ala He 

H75 HBO " ii 8 5 

Ser ES ll T JTT 2** S CC GAC TCT *** CTA GCA ATG ACT ATA ^T TCT 
Ser Trp Phe Val Glu Pro Asp Ser Lys Leu Ala Met Thr He Leu Ser 

1190 H95 • 1200 

AAC ATC CAC TCT TTA ACA GGC GAA GAA TGG ACC AAA AGG CAG CAT GGG 
Asn He His Ser Leu Thr Gly Glu Glu Trp Thr Lys Arg Gin His Gly 
1205 1210 1215 

TTC AAA AGA ACA GGG TCT GCC CTT CAT AGG TTT TCG ACA TCT CGG ATG 
Phe ^J** Thr G1 y Ser A *a ^u His Arg Phe Ser Thr Ser Arg Met 
1220 1225 1230 

AGC CAT GGT GGG TTC GCA TCT CAG AGC ACT GCA GCA TTG ACC AGG TTG 
Ser His Gly Gly Phe Ala Ser Gin Ser Thr Ala Ala Leu Thr Arg Leu 
1235 1240 1245 5 1250 

ATG GCA ACT ACA GAC ACC ATG AGG GAT CTG GGA GAT CAG AAT TTC PAP 
Met Ala Thr Thr Asp Thr Met Arg Asp Leu Gly Asp Gin A^n Phe S£ 

1255 1260 1265 

TTT TTA TTC CAA GCA ACG TTG CTC TAT GCT CAA ATT ACC ACC ACT GTT 
Phe Leu Phe Gin Ala Thr Leu Leu Tyr Ala Gin He Thr Thr Thr Val 

1270 1275 1280 

GCA AGA GAC GGA TGG ' ATC ACC AGT TGT ACA GAT CAT TAT CAT ATT GCC 
Ala Arg Asp Gly Trp He Thr Ser Cys Thr Asp His Tyr His He Ala 
1285 1290 1295 
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TGT AAG TCC TGT TTG AGA prr in ™* 

«. g. s „ cy, „ eu 5 g s sss ss ill s a 25 S XS 

1305 1310 
ATG GAC TAC ACG CCC CCA GAT GTA TCC CAT arc CTC aar fc « 

^ ^ Thr Pro So^ val s * r .™ 2 £ S 



1320 

1325 1330 



S2 SS Si S S3 IS a SS S Si SK £ £ S2 
IX SS S S K i£ Si S S Eg S£ s 5 as 

1355 1360 

ss s s a is e a =s ss a s s s s s 



1370 i 375 



S 2 S SS 2J £2 s £ £ S S IS g g 

1385 1390 

S S 15 S 5 S2 5 2? 2 SS 5 2 S S 

1405 141Q 

GCA AGT TGC TGC CAA GTA ATA rar rrr *«m „ 

Ala Ser Cys Cys ^JR £ His g £ 2 S ffi S £2 

1415 1420 



1425 



« s a ss ™ ss s^ffs: £ £ $ 

1435 1440 
TTG AGT GTA TCA CCT CCA TTC ptt t^t *» m 

l.« «, v.j s s« *. S s 25 £ £ is; g_5 5 » 



1455 



AGA GAC GAA TTA GAA ACG ATT rrr nar ~~ 

Ar, jjp au Leu Glu £ ££C His £ S 5£ 25 £ S 

1465 14?0 



ACA AGC AAC CGT GAT ATG GGG GTG ATT GTC AGA AAT TAP ttp **a ™„ 
TtaSer Asn Arg Asp Met Gly Val lie Val Arg £n lit $ g£ 



1480 148 r . 

1490 



Si ss ss 22 s as ss SS £ £ s » - » s « 



1495 ■ "oo 150S 



9370 



9418 



9466 



9514 



9562 



9610 



9658 



9706 



9754 



9802 



9850 



9898 



9946 



9994 



10042 



a s s ss ss si a e s ss s s ss ss s 

1515 1520 

22 lie Ser 5£ S 25 £S ^ ^ CCA TTT ™ TCT 

1525 ?«n lle eU Tyr Lys Pro phe ^ Ser 

1530 1535 

GGG AAA GAT AAG AAT GAG TTG AGA GAG CTG GCA AAT CTT TCT Tra ™n 

Gly Ly^Asp Lys Asn Glu Leu Arg Glu Leu Ala A^n S Se'r Se'r 22 10090 

1545 1550 
CTA AGA TCA GGA GAG GGG TGG GAA GAC AT* r&T r-rr 

Leu Arg Ser Gly Glu Gly Trp Glu Asp lie Sis vJ S° £ C ACC 10138 

1555 1560 yS Phe Phe Thr 

1565 1570 
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AAG GAC ATA TTA TTG TGT CCA GAG GAA ATC AGA CAT GCT TGC AAG TTC 
Lys Asp He Leu Leu Cys Pro Glu Glu He Arg His Ala Cys Lys Phe 

1575 1580 



1585 



GGG ATT GCT AAG GAT AAT AAT AAA GAC ATG AGC TAT CCC CCT TGG GGA 
Gly lie Ala Lys Asp Asn Asn Lys Asp Met Ser Tyr Pro Pro Trp Glv 

1590 1595 1600 

AGG GAA TCC AGA GGG ACA ATT ACA ACA ATC CCT GTT TAT TAT ACG ACC 
Arg Glu Ser Arg Gly Thr lie Thr Thr lie Pro Val Tyr Tyr Thr Thr 
1605 i6io 1615 

ACC CCT TAC CCA AAG ATG CTA GAG ATG CCT CCA AGA ATC CAA AAT CCC 
Thr Pro Tyr Pro Lys Met Leu Glu Met Pro Pro Arg lie Gin Asn Pro 
1620 1625' 1630 

CTG CTG TCC GGA ATC AGG TTG GGC CAA TTA CCA ACT GGC GCT CAT TAT 
Leu Leu Ser Gly lie Arg Leu Gly Gin Leu Pro Thr Gly Ala His Tyr 

1635 1640 1645 il; 0 

AAA ATT CGG AGT ATA TTA CAT GGA ATG GGA ATC CAT TAC AGG GAC TTC 
Lys lie Arg Ser lie Leu His Gly Met Gly lie His Tyr Arg Asp Phe 

1655 1660 " 1665 

TTG AGT TGT GGA GAC GGC TCC GGA GGG ATG ACT GCT GCA TTA CTA CGA 
Leu Ser Cys Gly Asp Gly Ser Gly Gly Met Thr Ala Ala Leu Leu Arg 
• 1670 1675 168 0 



T-T T , ^ Aw* UGA ATA TTC AAT AGT CTG TTA GAA TTA TCA 

Glu Asn Val His Ser Arg Gly lie Phe Asn Ser Leu Leu Glu Leu Ser 
1685 1690 

™» xw* GTC -ATG CGA GGC GCC TCT CCT GAG CCC CCC AGT GCC CTA GAA 
7 1700 Met " Ar9 ^05 SCr ^ ?J?. 8 " 



^ ™ ^ ^ ™ A 1 TC AGT CTG TTA GAA TTA TCA 

Leu ( 
1695 

i?v S3? 2£ ?? C I" S CT GAG CC ^ CCC AGT GCC CTA GAA 

Pro : 

1710 

Thr ? M ?** TCG AGA TGT GTA ** T GGT ^ ACA TGT TGG 

Thr^Leu Gly Gly Asp Lys Ser Arg Cys Val Asn Gly Glu Thr Cys Trp 

1725 1730 

GAA TAT CCA TCT GAC TTA TGT GAC CCA AGG ACT TGG GAC TAT TTC CTc' 
Glu Tyr Pro Ser Asp Leu Cys Asp Pro Arg Thr Trp Asp T^r Phe ££ 

1735 1740 1745 

CGA CTC AAA GCA GGC TTG GGG CTT CAA ATT GAT TTA ATT GTA ATG GAT 
Arg. Leu Lys Ala Gly Leu Gly Leu Gin He Asp Leu lie Val Met Isl 

1750 175S 176Q 

S vJT ? M c CT ? CT ACT AGC CTG ATT ^G ACG AAT GTT 

Glu ' 

1775 

^ ™. G !? £ AC CGG *™ F TG GAT ? AG CAA GGA GTT TTA ATC TAC 

1790 

AAG ACT TAT GGA ACA TAT ATT TGT GAG AGC GAA AAG AAT GCA GTA ACA 
Lys Thr Tyr Gly Thr Tyr lie Cys Glu Ser Glu Lys Asn ilia vll Thr 
9 - 1800 1805 leiQ 

ATC CTT GGT CCC ATG TTC AAG ACG GTC GAC TTA GTT CAA ACA GAA TTT 
He Leu Gly Pro Met Phe Lys Thr Val Asp Leu Val Gin Thr Glu pie 

1815 1820 1625 

AGT AGT TCT CAA ACG TCT GAA GTA TAT ATG GTA TGT AAA GGT TTG AAG 
Ser Ser Ser Gin Thr Ser Glu Val Tyr Met Val Cys Lys Glv Leu Lys 

1830 1835 1840 
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Met cT, v-l iZZ » t t u AGC CTG *** ATT GAG ACG AAT GTT 
Met Glu Val Arg Asp Ser Ser Thr Ser Leu Lys He Glu Thr Asn Val 
1765 1770 

TAT GTG CAC CGG ATT TTG GAT GAG CAA GGA GTT TTA ATC TAC 
Arg As^Tyr Val His Arg lie u«. Asp Glu Gin Gly_Val leu lie T^r 



10186 



10234 



102B2 



10330 



10378 



10426 



10474 



10522 



10570 



10618 



10666 



10714 



10762 



10810 



10858 



10906 



10954 



BNSDOCID: <WO 



9634625A1_I_> 



WO 96/34625 



PCT/US96/06053 



7I£ tT C Sf* CCC ** T CCC GAT TGG TCT TCC ATC AAT GAA TCC 

Lys Leu lie Asp Glu Pro Asn Pro Asp Trp Ser Ser lie Asn Glu Ser 

1845 1850 1855 

TGG AAA AAC CTG TAC GCA TTC CAG TCA TCA GAA CAG GAA TTT GCC AGA 
ft™ ^ Phe Gln Ser Ser Glu Gln Glu Phe Ala Arg 

1860 1865 1870 

GCA AAG AAG GTT AGT ACA TAC TTT ACC TTG ACA GGT ATT CCC TCC CAA 
Ala Lys Lys Val Ser Thr Tyr Phe Thr Leu Thr Gly lie Pro ier Gln 
. 1875 1880 1885 ■ 1890 

TTC ATT CCT GAT CCT TTT GTA AAC ATT GAG ACT ATG CTA CAA ATA TTC 
Phe Ixe Pro Asp Pro Phe Val Asn lie Glu Thr Met Leu Gln lie Phe 

1895 1900 1905 

GGA GTA CCC ACG GGT GTG TCT .CAT GCG GCT GCC TTA AAA TCA TCT GAT 
Gly Val Pro Thr Gly Val Ser His Ala Ala Ala £eu Lys Ser Ser AsJ 

1910 1915 1920 

AGA CCT GCA GAT TTA TTG ACC ATT AGC CTT TTT TAT ATG GCG ATT ATA 
Arg Pro Ala Asp Leu Leu Thr lie Ser Leu Phe Tyr Met Ala lie fie 
19-5 1930 1935 

Ser Er" £r £n TlS 2* ^ AGA GTA GGA CCG ATA CCT CCG **C 

VZo 7 ?t!. Ile Ar9 Val G1 y Pro Ile Pro *«> Asn 

A: " ilJ 1945 i95o 

Pro P^n cf* f T GGA A T T GCA ^ GTG GGG ATC GCT ATA ACT GGT 

Pr^Pro Ser Asp Gly lie Ala Gln Asn Val Gly lie Ala lie Thr Gly 

1"«0 1965 1970 

lie Ser Phe 22 t™ ^ P° ATG GAG *** GAC ATT CCA CTA TAT GA * 
Ile Ser Phe Trp Leu Ser Leu Met Glu Lys As lie ic- - G3n 

1975 I960 19e5 . 

G^n lyl Uu ST T?° ^ TCA TTC CCG ATT AGG TGG GAG GCT. 

Gln Cys Leu Ala Val Ile Gln Gln Ser Phe Pro Ile Arg Trp Glu Ala 

1990 1995 2 ooo 

Val sfr vlt ffi r? A TA ° ^ CAG ^ G TGG AGT ACT AGA GGT GAT 

Val Ser Val Lys Gly Gly Tyr Lys Gln Lys Trp Ser Thr Arg Gly Asp 

2005 2010 2015 

GIv Leu Prn ^ ^ *5 T TCA GAG TCC TTG GCC CCA ATC GGG 

Gly 7 ^ P Thr ^ g Thr Ser As P Ser Leu Ala Pro lie Gly 

2020 .2025 2030 

AAC TGG ATC AGA TCT CTG GAA TTG GTC CGA AAC CAA GTT CGT CTA AAT 
Asn Trp lie Arg Ser Leu Glu Leu Val Arg Asn Gln Val £g Leu* 
035 2040 2045 2050 

Pro Pho £ T r? C ™ TTC ^ CAG CTA TGT CGT ACA GTG GAT AAT 

Pro Phe Asn Glu lie Leu Phe Asn Gln Leu Cys Arg Thr Val Asp Asn 

2055 2060 2065 

CAT TTG AAA TGG TCA AAT TTG CGA AGA AAC ACA GGA ATG ATT GAA TGG 
His Leu Lys Trp Ser Asn Leu Arg Arg Asn Thr Gly Set lie ?™ 

2070 2075 2080' 

Til ^ £ GA P A ^ T TCA " GAA GAC CGG TCT ATA CTG ATG TTG AAG 

Ile Asn A ^ Ar 9 Ile L ^ Glu Asp Arg Ser lie Leu Met Leu Lys 

2085 2090. 2095 

AGT GAC CTA CAC GAG GAA AAC TCT TGG AGA GAT TAA AAAATCATrt* 
Ser Asp Leu His Glu Glu Asn Ser Trp Arq Asp AAAATCATGA 
2100 2105 " 



11002 



11050 



11098 



11146 



11194 



11242 



11290 



11338 



11386 



11434 



11482 



11530 



.11578 



11626 



11674 



11722 



11768 



- 86 - 



BNSDOCID: <WO _9634625A1_I_> 



r 



WO 96/34625 



PCT7US96/06053 



GGAGACTCCA AACTTTAAGT ATGAAAAAAA CTTTGATCCT TAAGACCCTC TTGTGGTTTT 
TATTTTTTAT CTGGTTTTGT GGTCTTCGTG GGTCGGCATG GCATCTCCAC CTCCTCGCGG 
TCCGACCTGG GCATCCGAAG GAGGACGTCG TCCACTCGGA TGGCTAAGGG AGGGGCCCCC 
GCGGGGCTGC TAACAAAGCC CGAAAGGAAG CTGAGTTGGC TGCTGCCACC GCTGAGCAAT 
AACTAGCATA ACCCCTTGGG GCCTCTAAAC GGGTCTTGAG GGGTTTTTTG CTGAAAGGAG 
GAACTATATC CGGATCGAGA CCTCGATACT AG TGCGGTGG AGCTCCAGCT TTTGTTCCCT 
TTAGTGAGGG TTAA7TTCGA GCTTGGCGTA ATCATGGTCA TAGCTGTTTC CTGTGTGAAA 
TTGTTATCCG CTCACAATTC CACACAACAT ACGAGCCGGA AGCATAAAGT GTAAAGCCTG 
GGGTGCCTAA TGAGTGAGCT AACTCACATT AATTGCGTTG CGCTCACTGC CCGCTTTCCA 
GTCGGGAAAC CTGTCGTGCC AGCTGCATTA ATGAATCGGC CAACGCGCGG GGAGAGGCGG 
TTTGCGTATT GGGCGCTCTT CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG 
GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTTATCCA .CAGAATCAGG 
GGATAACGCA GGAAAGAACA TGTGAG CAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA 
GGCCGCGTTG CTGGCGTTTT TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG . 
ACGCTCAAGT CAGAGGTGGC GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC 
TGGAAGCTCC CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC 
CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCATAG CTCA CGCTGTAGGT ATCTCAGTTC 
GGTGTAGGTC GTxOGCXCCA AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG 
CTGCGCCTTA TCCGGTAACT ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC 
ACTGGCAGCA GCCACTGGTA ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA 
GTTCTTGAAG TGGTGGCCTA ACTACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC 
TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG GCAAACAAAC 
CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG 
ATCTCAAGAA GATCCTTTGA TCTTTTCTAC GGGGTCTGAC 



ATTACGCGCA GAAAAAAAGG 
GCTCAGTGGA ACGAAAACTC 
ACGTTAAGGG ATTTTGGTCA TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA 
TTAAAAATGA AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA 



CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT 
TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG 



CTATTTCGTT CATCCATAGT 
GGCTTACCAT CTGGCCCCAG 



TGCTGCAATG ATACCGCGAG ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA 
GCCAGCCGGA AGGGCCGAGC GGAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC 
TATTAATTGT TGCCGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT 
TGTTGCCATT GCTACAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG CTTCATTCAG 
CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT 
TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT 
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11828 

11888 

11948 

12008 

12068 

12128 

12188 

12248 

12308 

12368 

12428 

12488 

12548 

12608 

12668 

12728 

12788 

12848 

12908 

12968 

13028 

13088 

13148 

13208 

13268 

13328 

13388 

13448 

13508 

13568 

13628 

13688 

13748 

13808 
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GGTTATGGCA GCACTGCATA ATTCTCTTAC TGTCATGCCA TCCGTAAGAT 
GACTGGTGAG TACTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC 
TTGCCCGGCG TCAATACGGG ATAATACCGC GCCACATAGC AGAACTTTAA 



GCTTTTCTGT 
CGAGTTGCTC 



AAGTGCTCAT 



CATTGGAAAA CGTTCTTCGG GG CG AAAACT 
TTCGATGTAA CCCACTCGTG CACCCAACTG 



CTCAAGGATC TTACCGCTGT TGAGATCCAG 
ATCTTCAGCA TCTTTTACTT TCACCAGCGT 



GGGCGACACG 
ATCAGGG TTA 



TAGGGGTTCC 



TTCTGGGTGA GCAAAAACAG GAAGGCAAAA TGCCGCAAAA AAGGGAATAA 
GAAATGTTGA ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT 
TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 
GCGCACATTT CCCCGAAAAG TGC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:. 

Met Ser Val Thr Val Lys Arg lie lie Asp Asn Thr Val lie Val Pro 

Lys Leu Pro Ala Asn Glu Asp Pr:- Val Glu Tyr Pr' z> , r , h ; 

■ 20 25 30- " 

Arg Lys Ser Lys Glu lie Pro Leu Tyr He Asn Thr Thr Lys Ser Leu 

40 45 

Ser Asp Leu Arg Gly Tyr Val Tyr Gin Gly Leu Lys Ser Gly Asn Val 

55 60 

Ser He lie His Val Asn Ser Tyr Leu Tyr Gly Ala Leu Lys Asp lie 

70 75 80 

Arg Gly Lys Leu Asp Lys Asp Trp Ser Ser Phe Gly lie Asn lie Gly 

85 90 95 

Lys Ala Gly Asp Thr He Gly lie Phe Asp Leu Val Ser Leu Lys Ala 

100 105 no 

Leu Asp Gly Val Leu Pro Asp Gly Val Ser Asp Ala Ser Arg Thr Ser 
115 "0 X25 

Ala Asp Asp Lys Trp Leu Pro Leu Tyr Leu Leu Gly Leu Tyr Arg Val 

135 140 

Gly Arg Thr Gin Met Pro Glu Tyr Arg Lys Lys Leu Met Asd Gly Leu 

150 155 * 160 

Thr Asn Gin Cys Lys Met He Asn Glu Gin Phe Glu Pro Leu Val Pro 

165 170 175 

Glu Gly Arg Asp lie Phe Asp Val Trp Gly Asn Asp Ser Asn Tyr Thr 

180 185 iso 

Lys He Val Ala Ala Val Asp Met Phe Phe His Met Phe Lys Lys Kis 



13868 

13928 

13988 

14048 

14108 

14168 

14228 

14288 

14311 
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195 200 205 

Glu Cys Ala Ser Phe Arg Tyr Gly Thr lie Val Ser Arg Phe Lys Asp 
210 215 220 

Cys Ala Ala Leu Ala Thr Phe Gly His Leu Cys Lys lie. Thr Gly Met 
225 230 235 * ~ 240 

Ser Thr Glu Asp Val Thr Thr Trp He Leu Asn Arg Glu Val Ala Asp 

245 x 250 " 255 

Glu Met Val Gin Met Met Leu Pro Gly Gin Glu He Asp Lys Ala Asp 

260 265 270 

Ser Tyr Met Pro Tyr Leu He Asp Phe Gly Leu Ser Ser Lys Ser Pro 
275 280 285 

Tyr Ser Ser Val Lys Asn Pro Ala Phe His Phe Trp Gly Gin Leu Thr 
290 295 300 

Ala Leu Leu Leu Arg Ser Thr Arg Ala Arg Asn Ala Arg Gin Pro Asp 
305 310 315 320 

Asp He Glu Tyr Thr Ser Leu Thr Thr Ala Gly Leu Leu Tyr Ala Tyr 

325 330 ' " 335 

Ala Val Gly Ser Ser Ala Asp Leu Ala Gin Gin Phe Cys Val Gly Asp 

340 345 * 350 

Asn Lys Tyr Thr Pro Asp Asp Ser Thr Gly Gly Leu Thr Thr Asn Ala 
355 360 " 365 

Pro Pro Gin Gly Arg Asp Val Val Glu Trp Leu Gly Trr> *>he Glu Asp 
370 . 375 3P0 

Gin Asn Arg Lys Pro Thr Pro Asp Met Met Gin Tyr Ala Lys Arg Ala 
385 390 395 " 400 

Val Met Ser Leu Gin Gly Leu Arg Glu Lys Thr lie Gly Lys Tyr Ala 

405 410 " * 415 

Lys Ser Glu Phe Asp Lys 

420 

(2) INFORMATION FOR SEQ ID NO : 3 : ■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Asp Asn Leu Thr Lys Val Arg Glu Tyr Leu Lys Ser Tyr Ser Arg 
1 5 10 J 15 

Leu Asp Gin Ala Val Gly Glu He Asp Glu He Glu Ala Gin Arg Ala 

20 25 30 

Glu Lys Ser Asn Tyr Glu Leu Phe Gin Glu Asp Gly Val Glu Glu His 
35 40 1 45 

Thr Lys Pro Ser Tyr Phe Gin Ala Ala Asp Asp Ser Asp Thr Glu Ser • 
50 55 60 
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Glu Pro Glu He Glu Asp Asn Gin Gly Leu Tyr Ala Pro Asp Pro Glu 
65 70 75 * 80 

Ala Glu Gin Val Glu Gly Phe He Gin Gly Pro Leu Asp Asp Tyr Ala 

85 90 95 

Asp Glu Glu Val Asp Val Val Phe Thr Ser Asp Trp Lys Gin Pro Glu 

100 105 " " " no 

Leu Glu Ser Asp Glu His Gly Lys Thr Leu Arg Leu Thr Ser Pro Glu 
115 120 " 125 

Gly Leu Ser Gly Glu Gin Lys Ser Gin Trp Leu Ser Thr He Lys Ala 
130 135 140 

Val Val Gin Ser Ala Lys Tyr Trp Asn Leu Ala Glu Cys Thr Phe Glu 
145 150 155 160 

Ala Ser Gly Glu Gly Val He Met Lys Glu Arg Gin He Thr Pro Asp 

165 170 175 

Val Tyr Lys Val Thr Pro Val Met Asn Thr His Pro Ser Gin Ser Glu 

180 185 190 

Ala Val Ser Asp Val Trp Ser Leu Ser Lys Thr Ser Met Thr Phe Gin 
1^5 200 205 

Pro Lys Lys Ala Ser Leu Gin Pro Leu Thr He Ser Leu Asp Glu Leu 
210 215 220 

Phe Ser Ser Arg Gly Glu Phe lie Ser Val Gly Gly Asp Gly Arg Met 
225 230 235 1 240 

Ser His Lys Clu.Ala I?« Leu Leu Gly Leu Arg Tyr Lys Lys Leu Tyr 

-i- • 250 255 

Asn Gin Ala Arg Val Lys Tyr Ser Leu 

260 J 265 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 9 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser Ser Leu Lys Lys lie Leu Gly Leu Lys Gly Lys Gly Lys Lys 
15 10 15 

Ser Lys Lys Leu Gly He Ala Pro Pro Pro Tyr Glu Glu Asp Thr Ser 

20 25 30 

Met Glu Tyr Ala Pro Ser Ala Pro lie Asp Lys Ser Tyr Phe Gly Val 
35 40 45 

Asp Glu Met Asp Thr Tyr Asp Pro Asn Gin Leu Arg Tyr Glu Lys Phe 
50 55 60 

Phe Phe Thr Val Lys Met Thr Val Arg Ser Asn Arg Pro Phe Arg Thr 
65 7 ° 75 ^ 80 

Tyr Ser Asp Val Ala Ala Ala Val Ser His Trp Asp His Met Tyr lie 
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85 90 95 

Gly Met Ala Gly Lys Arg Pro Phe Tyr Lys He Leu Ala Phe Leu Gly 

100 105 no 

Ser Ser Asn Leu Lys Ala Thr Pro Ala Val Leu Ala Asp Gin Gly Gin 
115 120 125 

Pro Glu Tyr His Thr His Cys Glu Gly Arg Ala Tyr Leu Pro His Arq 
130 135 140 

Met Gly Lys Thr Pro Pro Met Leu Asn Val Pro Glu His Phe Arg Arg 
145 150 155 160 

Pro Phe Asn He Gly Leu Tyr Lys Gly Thr He Glu Leu Thr Met Thr 

165 170 175 

He Tyr Asp Asp Glu Ser Leu Glu Ala Ala Pro Met He Trp Asp His 

180 185 iso 

Phe Asn Ser Ser Lys Phe Ser Asp Phe Arg Glu Lys Ala Leu Met Phe 
195 200 ~ 205 

Gly Leu He Val Glu Lys Lys Ala Ser Gly Ala Trp Val Leu Asp Ser 
210 215 220 

lie Ser His Phe Lys 
225 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

,(A), LENGTH: 511 amino acids >.y'vr- 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Lys Cys Leu Leu Tyr Leu Ala Phe Leu Phe He Gly Val Asn Cys 
15 io 15 

Lys Phe Thr He Val Phe Pro His Asn Gin Lys Gly Asn Trp Lys Asn 

20 25 ; ' * 30 

Val Pro Ser Asn Tyr His Tyr Cys Pro Ser Ser Ser Asp Leu Asn Trp 
35 40 45 

His Asn Asp Leu He Gly Thr Ala He Gin Val Lys Met Pro Lys Ser 
50 55 60 

His Lys Ala He Gin Ala Asp Gly Trp Met Cys His Ala Ser Lys Trp 
65 70 75 80 

Val Thr Thr Cys Asp Phe Arg Trp Tyr Gly Pro Lys Tyr He Th^ Gin 

85 90 ' 95 

Ser He Arg Ser Phe Thr Pro Ser Val Glu Gin Cys Lys Glu Ser He 

100 105 no 

Glu Gin Thr Lys Gin Gly Thr Trp Leu Asn Pro Gly Phe Pro Pro Gin 
115 120 125 • 

Ser Cys Gly Tyr Ala Thr Val Thr Asp Ala Glu Ala Val lie Val Gin 
130 135 140 
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Val Thr Pro His His Val Leu Val Asp Glu Tyr Thr Gly Glu Trp Val 
145 150 155 160 

Asp Ser Gin Phe lie Asn Gly Lys Cys Ser Asn Tyr lie Cys Pro Thr 

165 170 ' 175 

Val His Asn Ser Thr Thr Trp His Ser Asp Tyr Lys Val Lys Gly Leu 

180 185 190 

Cys Asp Ser Asn Leu lie Ser Met Asp, He Thr Phe Phe Ser Glu Asd 
195 200 205 

Gly Glu Leu Ser Ser Leu Gly Lys Glu Gly Thr Gly Phe Arg Ser Asn 
210 215 ' 220 

Tyr Phe Ala Tyr Glu Thr Gly Gly Lys Ala Cys Lys Met Gin Tyr Cys 
225 230 235 240 

Lys His Trp Gly Val Arg Leu Pro Ser Gly Val Trp Phe Glu Met Ala 

245 250 * 255 

Asp Lys Asp Leu Phe Ala Ala Ala Arg Phe Pro Glu Cys Pro Glu Gly 

260 265 270 

Ser Ser He Ser Ala Pro Ser Gin Thr Ser Val Asp Val Ser Leu He 
275 280 285 

Gin Asp Val Glu Arg He Leu Asp Tyr Ser Leu Cys Gin Glu Thr Trp 
290 295 300 

Ser Lys lie Arg Ala Gly Leu Pro He Ser Pro Val Asp Leu Ser Tyr 
305 310 315 320 

Leu Ala Pro Lys Pan Pro Gx> Thr Gly Pro Ala Phe Thr He He Asn 

325 330 335 

Gly Thr Leu Lys Tyr Phe Glu Thr Arg Tyr lie Arg Val Asp lie Ala 

340 345 350 

Ala Pro lie Leu Ser Arg Met Val Gly Met lie Ser Gly Thr Thr Thr 
355 360 365 

Glu Arg Glu Leu Trp Asp Asp Trp Ala Pro Tyr Glu Asp Val Glu lie 
370 375 380 

Gly Pro Asn Gly Val Leu Arg Thr Ser Ser Gly Tyr Lys Phe Pro Leu 
385 390 395 " ' 400 

Tyr Met He Gly His Gly Met Leu Asp Ser Asp Leu His Leu Ser Ser 

405 410 * 415 

Lys Ala Gin Val Phe Glu His Pro His He Gin Asp Ala Ala Ser Gin 

420 425 430 

Leu Pro Asp Asp Glu Ser Leu Phe Phe Gly Asp Thr Gly Leu Ser Lys 
435 .440 445 

Asn Pro lie Glu Leu Val Glu Gly Trp Phe Ser Ser Trp Lys Ser Ser 
450 455 * 460 

lie Ala Ser Phe Phe Phe lie lie Gly Leu lie lie Gly Leu Phe Leu 
465 470 475 " 480 

Val Leu Arg Val Gly lie His Leu Cys lie Lys Leu Lys His Thr Lys 

485 490 * ' 495 

Lys Arg Gin lie Tyr Thr Asp lie Glu Met Asn Arg Leu Gly Lys 
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500 505 510 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Glu Val His Asp Phe Glu Thr Asp Glu Phe Asn Asp Phe Asn Glu 
1 5 10 15 

Asp Asp Tyr Ala Thr Arg Glu Phe Leu Asn Pro Asp Glu Arg Met Thr 

20 25 30 

Tyr Leu Asn His Ala Asp Tyr Asn Leu Asn Ser Pro Leu lie Ser Asp 
35 40 45 

Asp He Asp Asn Leu He Arg Lys Phe Asn Ser Leu Pro He Pro Ser 
50 55 60 

Met Trp Asp Ser Lys Asn Trp Asp Gly Val Leu Glu Met Leu Thr Ser 
65 70 75 80 

Cys Gin Ala Asn Pro He Ser Thr Ser Gin Met His Lys Trp Met Gly 

85 .90 95 

Ser Trp Leu Met Srtr Asp Asn His Asp Ala Ser Gin Gly Tvr Ser Phe 

100 105 lio 



Leu His Glu Val Asp Lys Glu Ala Glu He Thr Phe Asp Val Val Glu 
115 120 125 

Thr Phe He Arg Gly Trp Gly Asn Lys Pro He Glu Tyr He Lys Lys 
13 0 135 140 

Glu Arg Trp Thr Asp Ser Phe Lys He Leu Ala Tyr Leu Cys Gin Lys 
145 150 155 " 160 

Phe Leu Asp Leu His Lys Leu Thr Leu He Leu Asn Ala Val Ser Glu 

165 170 175 

Val Glu Leu Leu Asn Leu Ala Arg Thr Phe Lys Gly Lys Val Arg Arg 

180 185 " * 190 

Ser Ser His Gly Thr Asn He Cys Arg He Arg Val Pro Ser Leu Gly 
195 200 ~ 205 

Pro Thr Phe He Ser Glu Gly Trp Ala Tyr Phe Lys Lys Leu Asp He 
210 215 220 

Leu Met Asp Arg Asn Phe Leu Leu Met Val Lys Asp Val He He Gly 
225 230 235 240 

• Arg Met Gin Thr Val Leu Ser Met Val Cys Arg He Asp Asn Leu Phe 

245 250 255 

Ser Glu Gin Asp lie Phe Ser Leu Leu Asn He Tyr Arg He Gly Asp 

260 265 " 270 

Lys He Val Glu Arg Gin Gly Asn Phe Ser Tyr Asp Leu He Lys Met 
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275 2B0 285 

Val Glu Pro He Cys Asn Leu Lys Leu Met Lys Leu Ala Arg Glu Ser 
290 295 300 

Arg Pro Leu Val Pro Gin Phe Pro His Phe Glu Asn His He Lys Thr 
305 310 ^ 315 J 320 

Ser Val Asp Glu Gly Ala Lys He Asp Arg Giy He Arg Phe Leu His 

325 330 * " 335 

Asp Gin He Met Ser Val Lys Thr Val Asp Leu Thr Leu Val He Tyr 

340 345 350 

Gly Ser Phe Arg His Trp Gly His Pro Phe He Asp Tyr Tyr Thr Gly 
355 360 365 

Leu Glu Lys Leu His Ser Gin Val Thr Met Lys Lys Asp He Asp Val 
370 375 " 380 

Ser Tyr Ala Lys. Ala Leu Ala Ser Asp Leu Ala Arg He Val Leu Phe 
385 390 395 ~ 400 

Gin Gin Phe Asn Asp His Lys Lys Trp Phe Val Asn Gly Asp Leu Leu 

405 410 ' 415 

Pro His Asp His Pro Phe Lys Ser His Val Lys Glu Asn Thr Trp Pro 

420 425 " 430 

Thr Ala Ala Gin Val Gin Asp Phe Gly Asp Lys Trp His Glu Leu Pro 
435 440 445 

Leu He Lys Cys Phe Glu lie Pro Asp Leu Leu Asp Pro Ser He He 

■ 450 

Tyr Ser Asp Lys Ser His Ser Met Asn Arg Ser Glu Val Leu Lys His 

465 470 475 480 

Val Arg Met Asn Pro Asn Thr Pro lie Pro Ser Lys Lys Val Leu Gin 

485 490 ' 495 

Thr Met Leu Asp Thr Lys Ala Thr Asn Trp Lys Glu Phe Leu Lys Glu 

500 505 sio 

He Asp Glu Lys Gly Leu Asp Asp Asp Asp Leu He lie Gly Leu Lys 
515 520 525 

Gly Lys Glu Arg Glu Leu Lys Leu Ala Gly Arg Phe Phe Ser Leu Met 
530 535 540 

Ser Trp Lys Leu Arg Glu Tyr Phe Val He Thr Glu Tyr Leu lie Lys 
545 550 . 555 560 

Thr His Phe Val Pro Met Phe Lys Gly Leu Thr Met Ala Asp Asp Leu 

565 570 575 

Thr Ala Val He Lys Lys Met Leu Asp Ser Ser Ser Gly Gin Gly Leu 

580 585 590 

Lys Ser Tyr Glu Ala lie Cys lie Ala Asn His lie Asd Tyr Glu Lys 
595 600 605 

Trp Asn Asn His Gin Arg Lys Leu Ser Asn Gly Pro Val Phe Arg Val 
610 615 620 

Met Gly Gin Phe Leu Gly Tyr Pro Ser Leu lie Glu Arg Thr His Glu 
S25 630 635 - ~ 640 
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Phe Phe Glu Lys Ser Leu He Tyr Tyr Asn Gly Arg Pro Asp Leu Met 

645 €50 " 655 

Arg Val Kis Asn Asn Thr Leu He Asn Ser Thr Ser Gin Arq Val Cvs 

660 665 670 

Trp Gin Gly Gin Glu Gly Gly Leu Glu Gly Leu Arg Gin Lys Gly Trp 
675 680 685 

Thr lie Leu Asn Leu Leu Val He Gin Arg Glu Ala Lys He Arg Asn 
690 695 7 00 

Thr Ala Val Lys Val Leu Ala Gin Gly Asp Asn Gin Val He Cys Thr 
705 710 715 7 720 

Gin Tyr Lys Thr Lys Lys Ser Arg Asn Val Val Glu Leu Gin Gly Ala 

725 7 3o 73 £ 

Leu Asn Gin Met Val Ser Asn Asn Glu Lys He Met Thr Ala lie Lys 

740 745 750 

He Gly Thr Gly Lys Leu Gly Leu Leu He Asn Asp Asp Glu Thr Met 
755 760 765 

Gin Ser Ala Asp Tyr Leu Asn Tyr Gly Lys He Pro He Phe Arg Gly 
770 775 780 

Val He Arg Gly Leu Glu Thr Lys Arg Trp Ser Arg Val Thr Cys Val 
785 790 795 eoo 

Thr Asn Asp Gin He Pro Thr Cys Ala Asn lie Met Ser Ser Val Ser 

805 810 B15 

Thr Asn Ala Leu Thr Vcl Ala :-ti £ *. Phe Ala Glu Asn Pro lie A*r= *Ha 

820 825 B3o " V~" 

Met He Gin Tyr Asn Tyr Phe Gly Thr Phe Ala Arg Leu Leu Leu Met 
835 840 845 

Met His Asp Pro Ala Leu Arg Gin Ser Leu Tyr Glu Val Gin Asp Lys 
850 855 ego 

lie Pro Gly Leu His Ser Ser Thr Phe Lys Tyr Ala Met Leu Tyr Leu 
865 870 875 * 8 8o 

Asp Pro Ser He Gly Gly Val Ser Gly Met Ser Leu Ser Arg Phe Leu 

885 890 895 

He Arg Ala Phe Pro Asp Pro Val Thr Glu Ser Leu Ser Phe Trp Aro 

900 9 °5 910 P 9 

Phe He His Val His Ala Arg Ser Glu His Leu Lys Glu Met Ser Ala 
915 920 925 

Val Phe Gly Asn Pro Glu He Ala Lys Phe Arg He Thr His He Asp 
yj0 9 35 940 

Lys Leu Val Glu Asp Pro Thr Ser Leu Asn He Ala Met Gly Met Ser 
45 950 9 55 9 60 

Pro Ala Asn Leu Leu Lys Thr Glu Val Lys Lys Cys Leu He Glu Ser 

965 9 70 " 975 

Arg Gin Thr He" Arg Asn Gin Val He Lys Asp Ala Thr He Tyr Leu 

98 0 985 990 

Tyr His Glu Glu Asp Arg Leu Arg Ser Phe Leu Trp Ser He Asn Pro 



- 95 - 



BNSDOCID: <WO. 



S634625A1_I_> 



WO 96/34625 PCTYUS96/06053 

995 1000 1005 

Leu Phe Pro Arg Phe Leu Ser Glu Phe Lys Ser Gly Thr Phe Leu Gly 
1010 1015 ' 1020 

Val Ala Asp Gly Leu He Ser Leu Phe Gin Asn Ser Arg Thr lie Arg 
1025 / 1030 .1035 ~ 1040 

Asn Ser Phe Lys Lys Lys Tyr His Arg Glu Leu Asp Asp Leu He Val 

1045 1050 1055 

Arg Ser Glu Val Ser Ser Leu Thr His Leu Gly Lys Leu His Leu Arg 

1060 1065 * 1070 

Arg Gly Ser Cys Lys Met Trp Thr Cys Ser Ala Thr His Ala Asp Thr 
1075 1080 1085 

Leu Arg Tyr Lys Ser Trp Gly Arg Thr Val He Gly Thr Thr Val Pro 
1090 1095 1100 

His Pro Leu Glu Met Leu Gly Pro Gin His Arg Lys Glu Thr Pro Cys 
1105 1110 1115 ' 1120 

Ala Pro Cys Asn Thr Ser Gly Phe Asn Tyr Val Ser Val His Cys Pro 

1125 1130 1135 

Asp Gly He His Asp Val Phe Ser Ser Arg Gly Pro Leu Pro Ala Tyr 

1140 1145 * . 1150 

Leu Gly Ser Lys Thr Ser Glu Ser Thr Ser lie Leu Gin Pro Trp Glu 
1155 1160 1165 

Arg Glu Ser Lys Val Pro Leu He Lys Arg Ala Thr Arg Leu Arg Asp 
1170 3 175 1180 

Ala He Ser Trp Phe Val Glu Pro Asp Ser Lys Leu Ala Met Thr He 
1185 1190 1195 1200 

Leu Ser Asn He His Ser Leu Thr Gly Glu Glu Trp Thr Lys Arg Gin 

1205 1210 1215 

His Gly Phe Lys Arg Thr Gly Ser Ala Leu His Arg Phe Ser Thr Ser 

1220 1225 1230 

Arg Met Ser His Gly Gly Phe Ala Ser Gin Ser Thr Ala Ala Leu Thr 
1235 1240 1245 

Arg Leu Met Ala Thr Thr Asp Thr Met Arg Asp Leu Gly Asp Gin Asn 
1250 1255 1260 

Phe Asp Phe Leu Phe Gin Ala Thr Leu Leu Tyr Ala Gin He Thr Thr 
1265 1270 1275 1280 

Thr Val Ala Arg Asp Gly Trp He Thr Ser Cys Thr Asp His Tyr His 

1285 1290 ' * 1295 

He Ala Cys Lys Ser Cys Leu Arg Pro lie Glu Glu He Thr Leu Asp 

1300 1305 1310 

Ser Ser Met Asp Tyr Thr Pro Pro Asd Val Ser His Val Leu Lys Thr 
1315 1320 * . 1325 

Trp Arg Asn Gly Glu Gly Ser Trp Gly Gin Glu He Lys Gin He Tyr 
1330 1335 1340 

Pro Leu Glu Gly Asn Trp Lys Asn Leu Ala Pro Ala Glu Gin Ser Tyr 
1345 1350 1355 1360 
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Gin Val Gly Arg Cys He Gly Phe Leu Tyr Gly Asp Leu Ala Tyr Arg 

1365 137 0 " ~ 1375 

Lys Ser Thr His Ala Glu Asp Ser Ser Leu Phe Pro Leu Ser He Gin 

1380 1385 1390 

Gly Arg He Arg Gly Arg Gly Phe Leu Lys Gly Leu Leu Asp Gly Leu 
1395 1400 1405 

Met Arg Ala Ser Cys Cys Gin Val He His Arg Arg Ser Leu Ala His 
1410 1415 1420 

Leu Lys Arg Pro Ala Asn Ala Val Tyr Gly Gly Leu He Tyr Leu He 
1425 1430 1435 * 1440 

Asp Lys Leu Ser Val Ser Pro Pro Phe Leu Ser Leu Thr Arg Ser Gly 

1445 1450 1455 

Pro He Arg Asp Glu Leu Glu Thr He Pro His Lys He Pro Thr Ser 

1460 ■ 1465 * 1470 

Tyr Pro Thr Ser Asn Arg Asp Met Gly Val lie Val Arg. Asn Tyr Phe 
1475 1480 1485 

Lys Tyr Gin Cys Arg Leu He Glu Lys Gly Lys Tyr Arg Ser His Tyr 
1490 1495 1500 

Ser Gin Leu Trp Leu Phe Ser Asp Val Leu Ser lie Asp Phe lie Gly 
1505 1510 1515 " 1520 

Pro Phe Ser He Ser Thr Thr Leu Leu Gin He Leu Tyr Lys Pro Phe 

1525 1530 1535 

Leu Ser Gly Lys Asp Lys Asn Glu T eu Arg Glu Leu Ala Asn Leu Ser " 

1540 1545 1550 " • *■ 

Ser Leu Leu Arg Ser Gly Glu Gly Trp Glu Asp He His Val Lys Phe 
1555 1560 1565 

Phe Thr Lys Asp He Leu Leu Cys Pro Glu Glu He Arg His Ala Cys 
1570 1575 1560 

Lys Phe Gly He Ala Lys Asp Asn Asn Lys Asp Met Ser Tyr Pro Pro 
1585 1590 1595 1600 

Trp Gly Arg Glu Ser Arg Gly Thr lie Thr Thr lie Pro Val Tyr Tyr 

1605 ' 1610 1615 

Thr Thr Thr Pro Tyr Pro Lys Met Leu Glu Met Pro Pro Arg lie Gin 

1620 1625 1630 

Asn Pro Leu Leu Ser Gly He Arg Leu Gly Gin Leu Pro Th^ Gly Ala 
1635 1640 " 1645 

His Tyr Lys He Arg Ser He Leu His Gly Met Gly lie His Tyr Arg 
1650 - 1655 1660 

Asp Phe Leu Ser Cys Gly Asp Gly Ser Gly Gly Met Thr Ala Ala Leu 
1665 1670 1675 1680 

Leu Arg Glu Asn Val His Ser Arg Gly lie Phe Asn Ser Leu Leu Glu 

1685 1690 1695 

Leu Ser Gly Ser Val Met Arg Gly Ala Ser Pro Glu Pro Pro Ser Ala 

1700 1705 1710 

Leu Glu Thr Leu Gly Gly Asp Lys Ser Arg Cys Val Asn Gly Glu Thr 

- 97 - 



BNSDOCID: <WO 9634625A 1 _l_> 



WO 96/34625 



PCT/US96/06053 



1715 • 1720 1725 

• Cys Trp Glu Tyr Pro Ser Asp Leu Cys Asp Pro Arg Thr Trp Asp Tyr 
1730 1735 * 1740 

Phe Leu Arg Leu Lys Ala Gly Leu Gly Leu Gin lie Asp Leu lie Val 
1745 1750 ^ 1755 1760 

Met Asp Met Glu Val Arg Asp Ser Ser Thr Ser Leu Lys lie Glu Thr 

1765 1770 1775 

Asn Val Arg Asn Tyr Val His Arg lie Leu Asp Glu Gin Gly Val Leu 

1780 1785 1790 

lie Tyr Lys Thr Tyr Gly Thr Tyr He Cys Glu Ser Glu Lys Asn Ala 
1795 1800 1805 

Val Thr He Leu Gly Pro Met Phe Lys Thr Val Asp Leu Val Gin Thr 
1810 1815 1820 

Glu Phe Ser Ser Ser Gin Thr Ser Glu Val Tyr Met Val Cys Lys Gly 
1825 1830 1635 ~ 1840 

Leu Lys Lys Leu He Asp Glu Pro Asn Pro Asp Trp Ser Ser He Asn 

1845 1850 " " 1855 

Glu Ser Trp Lys Asn Leu Tyr Ala Phe Gin Ser Ser Glu Gin Glu Phe 

1860 1865 1870 

Ala Arg Ala Lys Lys Val Ser Thr Tyr. Phe Thr Leu Thr Gly He Pro 
1875 1880 1885 

Ser Gin Phe He Pro Asp Pro Phe Val Asn He Glu Thr Met Leu Gin 
1890 1895 1900 

He Phe Gly Val Pro Thr Gly Val Ser His Ala Ala Ala Leu Lys Ser 
1905 1910 1915 ' 1920 

Ser Asp Arg Pro Ala Asp Leu Leu Thr He Ser Leu Phe Tyr Met Ala 

1925 1930 * 1935 

He He Ser Tyr Tyr Asn He Asn His He Arg Val Gly Pro He Pro 

1940 1945 ~ 1950 

Pro Asn Pro Pro Ser Asp Gly He Ala Gin Asn Val Gly He Ala He 
1955 I960 1965 

Thr Gly He Ser Phe Trp Leu Ser Leu Met Glu Lys Asp He Pro Leu 
1970 1975 1980 

Tyr Gin Gin Cys Leu Ala Val He Gin Gin Ser Phe Pro He Arg Trp 
1985 1990 1995 ■ 2000 

Glu Ala Val Ser Val Lys Gly Gly Tyr Lys Gin Lys Trp Ser Thr Arg 

2 005 2 010 * 2 015 

Gly Asp Gly Leu Pro Lys Asp Thr Arg Thr Ser Asp Ser Leu Ala Pro 

2020 2025 " 2030 

He Gly Asn Trp He Arg Ser Leu Glu Leu Val Arg Asn Gin Val Arg 
2035 2040 2045 

Leu Asn Pro Phe Asn Glu He Leu Phe Asn Gin Leu Cys Arc Thr Val 
2050 2055 2060 

Asp Asn His Leu Lys Trp Ser Asn Leu Arg Arg Asn Thr Gly Met lie 
2065 2070 2075 2080 
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Glu Trp He Asn Arg Arg He Ser Lys Glu Asp Arg Ser He Leu Met 

2 °85 2090 " 2095 

Leu Lys Ser Asp Leu His Glu Glu Asn Ser Trp Arg Asp 

2100 2105 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) .MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 : 



GCACTTTT CG 


GGGAAATGTG 


CGCGGAACCC 


CTATTTGTTT 


ATTTTTCTAA 


ATACATT CAA 


fin 


ATATGTATCC 


GCTCATGAGA 


CAATAACCCT 


GATAAATGCT 


T C AATAAT AT 


TG AAAAAGCi A 




AGAGTATGAG 


TATTCAACAT 


TTCCGTGTCG 


CCCTTATTCC 


CTTTTTTGCG 


GCATTTTGCC 


1 on 


TTCCTGTTTT 


TGCTCACCCA 


GAAACGCTGG 


TGAAAGTAAA 


AGATGCTGAA 


GATCAGTTCSn 


z. u 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


TCAACAGCGG 


TAAGATCCTT 


GAGAGTTTTP 


inn 


GCCCCGAAGA ACGTTTTCCA ATGATGAGCA 


CTTTTAAAGT 


/*^fT>/^ ^^rT>*7\ fT>^*fT^ 

I CTGCTATGT 


GGCGCGGTAT 


360 


TATCCCGTAT 


TGACGCCGGG 


CAAGAGCAAC 


TCGGTCGCCG 


CATACACTAT 


:/ : '*TCAG£A.TG. 




ACTTGGTTGA 


GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


GGATGGCATG 


ACAGTAAGAG 


480 


AATTATGCAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


GGCCAACTTA 


CTTCTGACAA 


54 0 


CGATCGGAGG 


ACCGAAGGAG 


CTAACCGCTT 


TTTTGCACAA 


CATGGGGGAT 


CATGTAACTC 


600 


GCCTTGATCG 


TTGGGAACCG 


GAGCTGAATG 


AAGCCATACC 


AAACGACGAG 


CGTGACACCA 


660 


CGATGCCTGT 


AGCAATGGCA ACAACGTTGC GCAAACTATT 


AACTGGCGAA 


CTACTTACTC 


720 


TAGCTTCCCG 


GCAACAATTA 


ATAGACTGGA 


TGGAGGCGGA 


TAAAGTTGCA 


GGACCACTTC 


780 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA 


TTGCTGATAA 


ATCTGGAGCC 


GGTGAGCGTG 


840 


GGTCTCGCGG 


TATCATTGCA 


GCACTGGGGC 


CAGATGGTAA 


GCCCTCCCGT 


ATCGTAGTTA 


900 


TCTACACGAC 


GGGGAGTCAG 


GCAACTATGG 


ATGAACGAAA 


TAGACAGATC 


GCTGAGATAG 


960 


GTGCCTCACT 


GATTAAGCAT 


TGGTAACTGT 


CAGACCAAGT 


TTACTCATAT 


ATACTTTAGA 


1020 


TTGATTTAAA 


ACTTCATTTT 


TAATTTAAAA 


GGATCTAGGT 


GAAGATCCTT 


TTTGATAATC 


1080 


TCATGACCAA 


AATCCCTTAA 


CGTGAGTTTT 


CGTTCCACTG 


AGCGTCAGAC 


CCCGTAGAAA 


1140 


AGATCAAAGG 


ATCTTCTTGA 


GATCCTTTTT 


TTCTGCGCGT 


AATCTGCTGC 


TTGCAAACAA 


1200 


AAAAACCACC 


GCTACCAG CG 


GTGGTTTGTT 


TGCCGGATCA 


AGAGCTACCA 


ACTCTTTTTC 


1260 


CGAAGGTAAC 


TGGCTTCAGC 


AGAGCGCAGA 


TACCAAATAC 


TGTCCTTCTA 


GTGTAGCCGT 


1320 


AGTTAGGCCA 


CCACTTCAAG 


AACTCTGTAG 


CACCGCCTAC 


ATACCTCGCT 


CTGCTAATCC 


1380 
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TGTTACCAGT 

GATAGTTACC 

GCTTGGAGCG 

CCACGCTTCC 

GAGAGCGCAC 

TTCGCCACCT 



GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC 
GGATAAGGCG CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA 
AACGACCTAC ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG 
CGAAGGGAGA AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 
GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT 
CTGACTTGAG CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT 
GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC 
ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA CCGTATTACC GCCTTTGAGT 



GAGCTGATAC 
CGGAAGAGCG 



CGCTCGCCGC AGCCGAACGA CCGAGCGCAG CGAGTCAGTG AGCGAGGAAG 
CCCAATACGC AAACCGCCTC TCCCCGCGCG TTGGCCGATT CATTAATGCA 
GCTGGCACGA CAGGTTTCCC GACTGGAAAG CGGGCAGTGA GCGCAACGCA ATTAATGTGA 
GTTAGCTCAC TCATTAGGCA CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATGTTGT 
GTGGAATTGT GAGCGGATAA CAATTTCACA CAGGAAACAG CTATGACCAT GATTACGCCA 
AGCTCGAAAT TAACCCTCAC TAAAGGGAAC AAAAGCTGGA GCTCCACCGC ACTAGTATCG 
AGGTCTCGAT CCGGATATAG TTCCTCCTTT CAGCAAAAAA CCCCTCAAGA CCCGTTTAGA 
GGCCCCAAGG GGTTATGCTA GTTATTGCTC AGCGGTGGCA GCAGCCAACT CAGCTTCCTT 
TCGGGCTTTG TTAGCAGCCC CGCGGGGGCC CCTCCCTTAG CCATCCGAGT GGACGACGTC 
CTCCTTCGGA TGCCCAGGTC GGACCGCGAG GAGGTGGAGA TGCCATGCCG ACCCACGAAG 
ACCACAAAAC CAGATAAAAA ATAAAAACCA CAAGAGGGTC TTAAGGATCA AAGTTTTTTT 
CATACTTAAA GTTTGGAGTC TCCTCATGAT TTTTTAATCT CTCCAAGAGT TTTCCTCGTG 
TAGGTCACTC TTCAACATCA GTATAGACCG GTCTTCTTTT GAAATTCGTC TATTGATCCA 
TTCAATCATT CCTGTGTTTC TTCGCAAATT TGACCATTTC AAATGATTAT CCACTGTACG 
ACATAGCTGA TTGAACAAGA TCTCATTGAA TGGATTTAGA CGAACTTGGT TTCGGACCAA 
TTCCAGAGAT CTGATCCAGT TCCCGATTGG GGCCAAGGAG TCTGAAGTTC GGGTATCTTT 
TGGGAGCCCA TCACCTCTAG TACTCCACTT CTGCTTGTAT CCTCCTTTTA CTGAAACAGC 
CTCCCACCTA ATCGGGAATG ATTGCTGGAT AACTGCTAAA CACTGTTGAT ATAGTGGAAT 



GTCTTTCTCC 
TGCAATTCCA 



ATCAAACTCA GCCAAAAGCT TATACCAGTT 
TCTGATGGGG GGTTCGGAGG TATCGGTCCT 



ATAGCGATCC CCACATTTTG 
ACTCTGATAT GATTGATGTT 



ATAATACGAT ATAATCGCCA TATAAAAAAG GCTAATGGTC AATAAATCTG CAGGTCTATC 
AGATGATTTT AAGGCAGCCG CATGAGACAC ACCCGTGGGT ACTCCGAATA TTTGTAGCAT 



AGTCTCAATG 
GTATGTACTA 
GTTTTTCCAG 
CAAACCTTTA 



TTTACAAAAG GATCAGGAAT GAATTGGGAG 
ACCTTCTTTG CTCTGGCAAA TTCCTGTTCT 
GATTCATTGA TGGAAGACCA ATCGGGATTG 
CATACCATAT ATACTTCAGA CGTTTGAGAA 



GGAATACCTG TCAAGGTAAA 
GATGACTGGA ATGCGTACAG 
GGTTCATCGA TTAATTTCTT 
CTACTAAATT CTGTTTGAAC 



1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

"?*O0 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 
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1 A •) 



TAAGTCGACC GTCTTGAACA TGGGACCAAG GATTGTTACT GCATTCTTTT CGCTCTCACA 
AATATATGTT CCATAAGTCT TGTAGATTAA AACTCCTTGC TCATCCAAAA TCCGGTGCAC 
ATAATTTCTA ACATTCGTCT CAATTTTCAG G CTAGTAGAA GAATCCCGAA CTTCCATATC 
CATTACAATT AAATCAATTT GAAGCCCCAA GCCTGCTTTG AGTCGGAGGA AATAGTCCCA 
AGTCCTTGGG TCACATAAGT CAGATGGATA TTCCCAACAT GTTTCACCAT TTACACATCT 
CGATTTATCT CCTCCTAAAG TTTCTAGGGC ACTGGGGGGC TCAGGAGAGG CGCCTCGCAT 
GACTGACCCT GATAATTCTA ACAGACTATT GAATATTCCT CTGCTATGCA CATTTTCTCG 
TAGTAATGCA GCAGTCATCC CTCCGGAGCC GTCTCCACAA CTCAAGAAGT CCCTGTAATG 
GATTCCCATT CCATGTAATA TACTCCGAAT TTTATAATGA GCGCCAGTTG GTAATTGGCC 
CAACCTGATT CCGGACAGCA GGGGATTTTG GATTCTTGGA GGCATCTCTA GCATCTTTGG 
GTAAGGGGTG GTCGTATAAT AAACAGGGAT TGTTGTAATT GTCCCTCTGG ATTCCCTTCC 
CCAAGGGGGA TAGCTCATGT CTTTATTATT ATCCTTAGCA ATCCCGAACT TGCAAGCATG 
TCTGATTTCC TCTGGACACA ATAATATGTC CTTGGTGAAG AATTTCACAT GTATGTCTTC 
CCACCCCTCT CCTGATCTTA GCAATGAAGA AAGATTTGCC AGCTCTCTCA ACTCATTCTT 
ATCTTTCCCA GATAAAAATG GCTTGTATAG GATTTGCAAG AGGGTGGTGG AAATAGAGAA 
TGGTCCAATG AAGTCTATGG ATAAGACATC TGAGAATAAC CATAATTGTG AATAATGTGA 
TCTGTATTTT CCCTTTTCAA TTAGACGGCA TTGGTATTTG AAGTAATTTC TGACAATCAC 
CCCCATATCA CGGTTGCTTG TCGGATAGGA GGTTGGGATC TTGTGGGGAA TCGTTTCTAA 
TTCGTCTCTA ATAGGTCCTG ATCTAGTAAG AGAAAGGAAT GGAGGTGATA CACTCAATTT 
ATCAATCAAG TAAATCAAAC CTCCGTACAC TGCGTTGGCC GGCCTCTTCA AATGAGCCAG 
ACTTCTCCGG TGTATTACTT GGCAGCAACT TGCTCTCATT AATCCGTCTA GCAACCCTTT 
TAAGAAACCT CGACCTCTAA TACGACCTTG TATAGATAGA GGAAATAGAG AACTGTCCTC 
GGCATGAGTA GATTTTCTAT ACGCCAAGTC TCCATATAGA AAACCTATAC ATCTGCCGAC 
TTGATAGGAT TGCTCAGCAG GTGCTAAATT CTTCCAATTC CCTTCTAAAG GATAGATCTG ' 
TTTTATCTCT TGTCCCCACG AACCTTCCCC ATTCCTCCAT GTCTTCAGCA CATGGGATAC 
ATCTGGGGGC GTGTAGTCCA TACTTGAGTC CAGGGTGATC TCTTCTATGG GTCTCAAACA 
GGACTTACAG GCAATATGAT AATGATCTGT ACAACTGGTG ATCCATCCGT CTCTTGCAAC 
AGTGGTGGTA ATTTGAGCAT AGAGCAACGT TGCTTGGAAT AAAAAGTCGA AATTCTGATC 
TCCCAGATCC CTCATGGTGT CTGTAGTTGC CATCAACCTG GTCAATGCTG CAGTGCTCTG 
AGATGCGAAC CCACCATGGC TCATCCGAGA TGTCGAAAAC CTATGAAGGG CAGACCCTGT 
TCTTTTGAAC CCATGCTGCC TTTTGGTCCA TTCTTCGCCT GTTAAAGAGT GGATGTTAGA 
AAGTATAGTC ATTGCTAGTT TAGAGTCGGG TTCAACAAAC CAAGAGATAG CATCTCTAAG 
ACGTGTAGCT CTTTTAATCA GTGGGACTTT GCTTTCCCTT TCCCAAGGCT GCAAAATAGA 
TGTAGATTCA GATGTTTTAG ACCCTAGATA AGCAGGCAAT GGTCCCCGTG AACTAAAGAC 



3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 
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5160 

5220 

5280 

5340 

5400 
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GTCATGGATC CCG TCTGGAC AATGCACAGA AACATAATTG AACCCTGATG TGTTACATGG 
TGCACAAGGA GTCTCTTTTC GATGTTGTGG ACCCAACATT TCTAATGGAT GGGGTACAGT 
TGTCCCAATA ACTGTACGGC CCCAGGATTT GTATCTTAAT GTGTCAGCAT GAGTAGCTGA 
ACATGTCCAC ATTTTACATG ATCCCCTTCT CAAATGAAGT TTCCCTAAAT GTGTCAAAGA 
GGATACCTCA CTCCTCACAA TCAAATCATC CAATTCCCTA TGATACTTTT TCTTAAAGGA 
GTTCCGAATA GTACGAGAAT TTTGAAATAG ACTGATGAGC CCGTCTGCGA CTCCCAAAAA 
AGTGCCTGAT TTGAATTCAC TTAAAAATCT AGGGAACAGA GGATTTATTG ACCATAAGAA 
ACTTCTGAGC CGATCCTCTT CATGATACAA ATATATGGTT GCATCCTTAA TCACCTGGTT 
CCTGATGGTT TGTCTTGATT CGATTAAGCA TTTTTTAACC TCAGTCTTTA ACAAGTTCGC 
TGGACTCATT CCCATAGCGA TGTTCAGAGA GGTTGGATCT TCTACTAGCT TGTCTATGTG 
AGTTATTCGA AACTTGGCTA TCTCGGGGTT TCCAAATACT GCACTCATCT CCTTCAGATG 
CTCACTTCGA GCATGTACAT GGATGAATCT CCAGAATGAG AGACTTTCTG TTACGGGATC 
TGGGAAGGCT CTAATCAAAA ACCTGGACAA AGACATGCCC GACACTCCTC CAATGGAAGG 
GTCCAAATAC AACATGGCGT ATTTGAAAGT AGAACTGTGC AAGCCCGGTA TCTTATCTTG 
AACTTCATAC AATGATTGAC GAAGAGCAGG ATCATGCATC ATCAACAAGA GTCTAGCAAA 
TGTCCCAAAA TAATTGTACT GTATCATGGC " ATTGATTGGG TTCTCAG CAA AATGAGCTAC 
GGTGAGAGCA TTTGTGGAAA CTGAGCTCAT TATATTAGCA CAAGTGGGTA TTTGGTCATT 
GGTGACACAA GTCACTCGTG .ACCATCTCTT GGTCTCTAAC CCTCTAATCA CTCCACGGAA 
AATCGGTATT TTTCCATAAT TCAAGTAATC TGCAGATTGC ATAGTCTCAT CGTCATTTAT 
CAAAAGTCCT AACTTCCCTG TCCCTATTTT GATTGCAGTC ATAATTTTCT CATTATTAGA 
AACCATTTGA TTGAGAGCAC CCTGTAATTC TACAACGTTT CTCGATTTCT TCGTTTTATA 
CTGTGTGCAA ATAACTTGAT TATCACCTTG TGCCAAGACT TTGACAGCAG TGTTTCTGAT 
TTTAGCCTCT CTTTGAATAA C CAGTAG ATT GAGGATAGTC CATCCTTTTT GCCGTAGACC 
TTCCAGTCCA CCCTCTTGTC CTTGCCAACA AACTCGTTGG GAGGTTGAAT TGATCAGTGT 
GTTGTTGTGA ACACGCATCA AGTCTGGTCT TCCATTGTAG TATATAAGAC TTTTCTCAAA 
AAATTCATGA GTTCTCTCGA TTAAGGATGG ATAACCTAAG AACTGGCCCA TAACTCGGAA 
CACTGGGCCG TTTGATAACT TCCTTTGGTG GTTATTCCAT TTTTCGTAAT CAATGTGATT 
GGCTATGCAA ATTGCCTCAT ATGACTTCAA TCCTTGGCCG GATGAG GAAT CTAACATCTT 
TTTAATGACT GCAGTTAGAT CGTCCGCCAT TGTCAGGCCT TTAAACATAG GGACGAAATG 
AGTCTTTATC AAATATTCGG TAATTACAAA GTATTCTCGC AATTTCCAAG ACATTAGGGA 
GAAAAATCTA CCTGCCAACT TCAGTTCCCT CTCCTTTCCT TTAAGACCAA TAATTAGATC 
ATCATCATCT AAGCCCTTCT CATCAATCTC TTTAAGAAAT TCTTTCCAAT TGGTAGCCTT 
TGTGTCCAAC ATAGTCTGCA ACACCTTTTT ACTAGGGATA GGAGTGTTCG GATTCATTCG 
GACATGTTTC AACACCTCTG ACCTATTCAT TGAATGACTT TTGTCAGAGT ATATTATCGA 
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TGGGTCTAGT AAGTCGGGTA TTTCAAAACA TTTAATCAGC GGAAGTTCAT GCCATTTATC 
TCCAAAATCT TGAACTTGAG CAGCTGTGGG CCATGTATTT TCTTTAACAT GACTTTTAAA 
GGGATGATCA TGAGGGAGCA AGTCTCCATT CACGAACCAC TTTTTATGAT CATTGAACTG 
TTGAAATAGA ACAATCCGAG CTAAATCACT TGCAAGTGCT TTTG CATATG ACACATCAAT 

« 

ATCTTTCTTC ATGGTTACTT GGGAATGTAA TTTTTCTAGT CCAGTGTAAT AATCTATAAA 
AGGATGACCC CAATGTCTGA ACGATCCATA AATCACCAGT GTGAGATCCA CTGTTTTCAC 
ACTCATTATC TGATCATGGA GGAATCTTAT ACCTCGGTCA ATTTTTGCCC CTTCATCAAC 
AGAAGTCTTG ATATGATTTT CAAAATGAGG GAATTGTGGG ACTAAAGGCC TTGATTCTCT 
TGCTAATTTC ATCAGCTTCA AGTTGCATAT CGGTTCCACC ATTTTAATCA AGTCATAAGA 
AAAATTTCCC TGCCTCTCCA CAATTTTATC TCCAATTCTG TAGATATTTA GAAGGGAGAA 
GATGTCTTGC TCTGAGAACA GGTTGTCTAT TCTACATACC ATGGATAGCA CCGTTTGCAT 
CCTCCCTATA ATCACATCTT TGACCATTAA CAGAAAGTTT CGGTCCATTA GAATATCAAG 
TTTCTTGAAG TAAGCCCATC CTTCTGAAAT AAAAGTAGGA CCCAAGCTGG GAACCCTAAT 
CCTGCATATG TTCGTTCCAT GAGAACTTCT TCTGACTTTG CCTTTGAAAG TCCTCGCCAA 
GTTGAGCAAT TCCACCTCAG AGACAGCATT TAAGATTAAT GTCAACTTGT GTAAGTCCAA 
AAACTTTTGA CACAAATAAG CGAGAATTTT GAATGAGTCA GTCCATCTTT CCTTTTTGAT 
GTATTCAATT GGTTTGTTGC CCCAGCCGCG GATGAAGGTC TCCACCACGT CAAATGTTAT 
..JTCTGCCTCT TTGTCCACTT CATGTAAAAA ACTATACCCT TGACTGGCAT CATGATTATC 
AGACATTAAC CAACTTCCCA TCCATTTATG CATCTGAGAT GTTGAGATGG GATTGGCTTG 
ACATGATGTT AACATCTCAA GAACTCCATC CCAGTTCTTA CTATCCCACA TCGAGGGAAT 
CGGAAGAGAA TTGAATTTCC TGATCAAATT GTCAATATCA TCACTAATTA GAGGAGAATT 
CAAATTGTAA TCAGCATGAT TCAAGTACGT CATGCGCTCA TCGGGATTCA GGAATTCTCT 
TGTGGCATAG TCATCTTCAT TGAAATCATT GAACTCGTCG GTCTCAAAAT CGTGGACTTC 
CATGATTGCT GTTAGTTTTT TTCATAAAAA TTAAAAACTC AAATATAATT GAGGCCTCTT 
TGAGCATGGT ATCACAAGTT GATTTGGTCC AAACATGAAG AATCTGGCTA GCAGGATTTG 
AGTTACTTTC CAAGTCGGTT CATCTCTATG TCTGTATAAA TCTGTCTTTT CTTGGTGTGC 
TTTAATTTAA TGCAAAGATG GATACCAACT 
CCTATGATAA AGAAAAAAGA GGCAATAGAG 



CGGAGAACCA AGAATAGTCC AATGATTAAC 
CTTTTCCAAC TACTGAACCA ACCTTCTACA 



AGCTCGATTG GATTTTTGGA TAG CC CAGTA TCACCAAAAA ATAAACTCTC ATCATCAGGA 
AGTTGCGAAG CAGCGTCTTG AATGTGAGGA TGTTCGAACA CCTGAGCCTT TGAG CTAAGA 
TGAAGATCGG AGTCCAACAT ACCATGTCCA ATCATGTATA AAGGAAACTT ATATCCTGAA 
CTGGTCCTCA GAACTCCATT GGGTCCAATT TCCACGTCTT CATATGGTGC CCAGTCATCC 
CACAGTTCCC TTTCTGTGGT AGTTCCACTG ATCATTCCGA CCATTCTTGA GAGGATTGGA 
GCAGCAATAT CGACTCTGAT GTATCTGGTC TCAAAGTATT TTAGGGTACC ATTGATTATG 
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GTGAAAGCAG GACCGGTTCC TGGGTTTTTA GGAGCAAGAT AGCTGAGATC CACTGGAGAG 
ATTGGAAGAC CCGCTCTGAT TTTGCTCCAG GTTTCTTGGC AGAGGGAATA ATCCAAGATC 
CTCTCAACGT CCTGAATTAG ACTTACATCC ACTGAGGTCT GAGATGGAGC AGAGATACTT 
GACCCTTCTG GGCATTCAGG GAATCTGGCT GCAGCAAAGA GATCCTTATC AGCCATCTCG 
AACCAGACAC CTGATGGGAG TCTGACTCCC CAATGCTTGC AGTATTGCAT TTTGCAGGCC 
TTGCCTCCAG TTTCATAAGC AAAGTAGTTA CTTCTGAACC CTGTGCCCTC CTTTCCCAGG 
GATGATAGCT CTCCGTCCTC TGAGAAGAAG GTGATGTCCA TGGAAATGAG GTTAGAATCA 
CATAGCCCTT TGACCTTATA GTCAGAATGC CAGGTTGTAG AGTTATGGAC AGTGGGGCAT 
ATGTAATTGC TGCATTTTCC GTTGATGAAC TGTGAATCAA CCCATTCTCC TGTGTATTCA 
TCAACCAGCA CATGGTGAGG AGTCACCTGG ACAATCACTG CTTCGGCATC CGTCACAGTT 
GCATATCCAC AACTTTGAGG AGGGAAGCCT GGATTCAGCC AAGTTCCTTG TTTCGTTTGT 
TCAATGCTTT CCTTGCATTG TTCTACAGAT GGAGTGAAGG ATCGGATGGA CTGTGTTATA 
TACTTCGGTC CATACCAGCG GAAATCACAA GTAGTGACCC ATTTGGAAGC ATGACACATC 
CAACCGTCTG CTTGAATAGC CTTGTGACTC TTGGGCATTT TGACTTGTAT GGCTGTGCCT 
ATTAAGTCAT TATGCCAATT TAAATCTGAG CTTGACGGGC AATAATGGTA ATTAGAAGGA 
ACATTTTTCC AGTTTCCTTT TTGGTTGTGT GGAAAAACTA TGGTGAACTT GCAATTCACC 
CCAATGAATA AAAAGGCTAA GTACAAAAGG CACTTCATAG TGACGCGTAA ACAGATCGAT 
CTCTGTTAGT TTTTTTCATA GGGATAGAAA AGACAGGATA TTAGTTGTTC GAGAGGCTGG 
AATTAGGAGA GACTGAGTAA ACCGGGGATT GTTCAGAAGC TAGAAGTTAG ACTAGCTGAT 
TTGAAGTGGC TGATAGAATC CAGGACCCAC GCTCCAGATG CCTTTTTCTC GACAATCAGG 
CCAAACATTA AGGCCTTCTC TCTGAAATCA GAAAATTTGG AAGAATTGAA ATGATCC CAG 
ATCATAGGAG CTGCTTCCAG TGACTCATCA TCGTAGATGG TCATTGTGAG CTCAATCGTT 
CCCTTGTAAA GACCTATATT GAATGGTCTT CTGAAGTGCT CTGGTACATT GAGCATGGGA 
GGGGTCTTCC CCATCCTATG TGGCAAATAA GCCCTGCCTT CGCAGTGAGT GTGATACTCT 
GGTTGACCTT GATCTGCCAA TACCGCTGGA GTGGGCTTTA GATTAGAAGA ACCCAAAAAA 
GCCAAGATTT TGTAGAAGGG ACGTTTCCCT GCCATTCCGA TGTACATGTG ATCCCAATGG 
GATACAGCGG CTGCCACATC TGAGTATGTT CTGAACGGAC GATTAGATCT AACCGTCATT 
TTCACTGTAA AGAAGAATTT CTCATATCTT AATTGATTCG GATCATAGGT GTCCATCTCG 
TCAACTCCAA AATAGGATTT GTCAATTGGA GCGCTCGGAG CATACTCCAT GCTAGTGTCC 
TCTTCATAAG GGGGTGGTGC GATCCCTAAT TTCTTAGATT TCTTAGCTTT CCCCTTCAGA 
CCGAGAATCT TCTTTAAGGA ACT CATGATG AATGGATTGG GATAACACTT AGATCGTGAT 
ATCTGTTACT TTTTTTCATA GTCTACAGAG AATATTTGAC TCTCGCCTGA TTGTACAACT 
TTTTGTATCT CAGGCCGAGC AGGATGGCCT CTTTATGAGA CATTCGTCCG TCACCTCCGA 
CAGAGATGAA CTCTCCTCTA GAT GAGAAC A ATTCATCCAA GGATATGGTG AGAGGCTGAA 
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GACTTGCTTT CTTGGGTTGG AAAGTCATGG ATGTCTTTGA GAG AG AC C AA ACATCTGATA 11640 

CTGCTTCTGA TTGGGACGGA TGTGTGTTCA TCACTGGAGT GACCTTATAT ACATCCGGAG 11700 

TTATCTGGCG CTCCTTCATA ATGACCCCTT CTCCCGATGC TTCAAATGTG CACTCTGCCA 11760 

GATTCCAGTA TTTGGCACTT TGCACGACTG CTTTAATCGT CGAAAGCCAC TGGGATTTCT 11820 

GCTCTCCACT TAAACCCTCT GGCGATGTCA ACCGTAAGGT CTTTCCATGC TCGTCAGATT 11880 

CAAGCTCAGG CTGTTTCCAG TCCGAAGTAA ATACAACATC CACTTCCTCA TCTGCATAGT 11940 

CATCTAAAGG CCCCTGTATA AAGCCTTCAA CTTGCTCAGC TTCTGGATCT GGTGCATACA 12 000 

AACCTTGATT GTCTTCAATT TCTGGTTCAG ATTCTGTGTC AGAATCATCT GCTGCCTGAA 12 060 

AATAAGAGGG CTTAGTATGC TCTTCCACTC CATCCTCTTG GAACAACTCA TAATTGGACT 12120 

TTTCAGCTCG TTGTGCTTCG ATCTCATCTA TCTCTCCTAC CGCCTGATCC AGACGAGAAT 12180 

AGGACTTGAG ATACTCACGA ACTTTTGTGA GATTATCCAT GATATCTGTT AGTTTTTTTC 12240 

ATATGTAGCA TAATATATAA TAGGTGATCT GAGAATTATA GGGTCATTTG TCAAATTCTG 123 00 

ACTTAGCATA CTTGCCAATT GTCTTCTCTC TTAGGCCTTG CAGTGACATG ACTGCTCTTT 12 360 

TCGCATACTG CATCATATCA GGAGTCGGTT TTCTGTTTTG ATCTTCAAAC CATCCGAGCC 12420 

ATTCGACCAC ATCTCTGCCT TGTGGCGGTG CATTAGTCGT CAATCCTCCG GTACTATCAT 124 80 

CTGGAGTGTA TTTGTTATCT CCAACACAAA ACTGTTGTGC CAAGTCGGCA GAGGATCCTA 12 54 0 

CTGCATAAGC GTACAACAAA CCTGCTGTAG TAAGAGATGT ATACTCAATG TCATCAGGCT 12600. 

GivWiCATT CCTTGCTCTG GTGGATCTGA GCAGAAGAGC TGTCAATTGC CCCCAGAAGT 12660 

GGAAGGCAGG GTTTTTGACG GAAGAATATG GAGACTTAGA AG ACAATC C A AAGTCGATCA 12 720 

AATAAGG CAT GTATGAATCG GCCTTGTCAA TTTCTTGGCC TGGAAGCATC ATTTGGACCA 12780 

TTTCATCTGC AACTTCTCGG TTCAAGATCC AG GTCGTT AC ATCTTCTGTA GACATTCCGG 12 84 0 

TTATTTTGCA GAGGTGTCCA AATGTTGCCA ATGCAGCACA ATCTTTGAAT CTGGAAACAA 12 9O0 

TAGTTCCGTA TCTGAACGAG GCACATTCAT GTTTTTTGAA CATGTGGAAG AACATGTCCA 12 960 

CTGCAGCGAC AATTTTTGTG TAATTACTG T CATTTCCCCA CACATCAAAA ATGTCACGAC 13 020 

CTTCTGGCAC AAGAGGTTCA AACTGTTCAT TGATCATTTT GCATTGATTT GTCAGCCCAT 13080 

CCATGAGCTT TTTTCTGTAT TCAGG CATTT GTGTTCTGCC CACTCTGTAT AAGCCAAGTA 1314 0 

GATACAAAGG CAACCATTTG TCATCTGCGC TGGTTCTGGA AG CATC C GAT ACTCCATCTG 13 2 00 

GAAGTACGCC GTCCAGGGCT TTCAAGGATA CAAGGTCAAA TATTCCGATT GTATCCCCTG 132 60 

CTTTCCCGAT GTTTATTCCG AAACTTG AC C AATCTTTATC CAACTTACCC CGGATGTCCT 13320 

TTAATGCTCC ATACAAGTAG CTGTTGACAT GTATGATTGA TACATTTCCG GATTTGAGGC 13 3 80 

CTTGGTAGAC ATATCCTCTT AGATCTGACA AACTTTTTGT AGTATTGATG TAAAGAGGAA 1344 0 

TCTCCTTTGA TTTT CTGAAG TAATCTGCCG GGTATTCCAC TGGATCCTCA TTTGCAGGAA 13 500 

GTTTTGGAAC TATGACTGTG TTGTCAATGA TTCTCTTGAC TGTAACAGAC ATTTTGATTA 13 560 
CTGTTAAAGT TTCTCCTGAG CCTTTTAATG ATAATAATGG TTTGTTTGTC TTCGTCCCTA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
TTGTAATACG ACTCACTATA GGG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 
(Cj STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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TAGTGAGTCG TATTACAACT CGAGGGGGGG CCCGGTACCC AATTCGCCCT AT AG TGAGTC 13 680 
GTATTACAAT TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC 13740 
CCAACTTAAT CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC 13800 
CCGCACCGAT CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGGG ACGCGCCCTG 
TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC 
CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG 13980 
CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT CCCTTTAGGG TTCCGATTTA GTGCTTTACG 14 04 0 
GCACCTCGAC CCCAAAAAAC TTGATTAGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG 
ATAGACGGTT TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT 
CCAAACTGGA ACAACACTCA ACCCTATCTC GGTCTATTCT TTTGATTTAT AAGGGATTTT 
GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA CAAAAATTTA ACGCGAATTT 
TAACAAAATA TTAACG CTT A CAATTTAGGT G 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
TTGTAATACG ACTCACTATA GGG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



- 106 - 



BNSDOCID; <WO _9634625A1_L> 



f 



WO 96/34625 PCT/US96/06053 



> 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ACGAAGACAA ACAAACCATT ATT AT CATT A AAAGGCTCAG GAGAAACTTT 50 
(2) INFORMATION FOR SEQ ID NO: 11: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGCTGCTAAC AAAGCCCGAA AGGAAGCTGA GTTGGCTGCT GCCACCGCTG AGCAATAACT 60 
AGCATAACCC CTTGGGGCCT CTAAACGGG T CTTGAGGGGT TTTTTGCTGA AAGGAGGAAC 120 
TATATCCGGA TCGAGA 136 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 
GGGTCGGCAT GGCATCTCCA CCTCCTCGCG GTCCGACCTG GGCATCCGAA GGAGGACGTC 60 
GTCCACTCGG ATGGCTAAGG GAG 83 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

TTGTAGAAGG TTGGTTCAGT AGTTGGAAAA GCTCTATTGC CTCTTTTTTC TTTATCATAG 60 

GGTTAATCAT TGGACTATTC TTGGTTCTCC GAGTTGGTAT CCATCTTTGC ATTAAATTAA 12 0 

AGCACACCAA GAAAAGACAG ATTTATACAG ACATAGAGAT GAACCGACTT GGAAAGTAAC 180 
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TCAAATCCTG CTAGCTATGA AAAAAACTAA CAGATATCCA ACCCGGGAGC TAGTTGCGGC 24 0 

CGCCTAGCAG ATTCTTCATG TTTGGACCAA ATCAACTTGT GATACCATGC TCAAAGAGGC 3 00 

CTCAATTATA TTTGAGTTTT TAATTTTTAT GAAAAAAACT AACAGCAATC ATGGAAGTCC 360 

ACGATTTTGA GACCGACGAG •TTCAATGATT TCAATGAAGA TGACTATGCC ACAAGAGAAT 42 0 

TCCTGAATCC CGATGAGCGC ATGACGTACT TGAATCATGC TGATTACAAT TTGAATTCTC 480 

CTCTAATTAG TGATGATATT GACAATTTGA TCAGGAAATT CAATTCTCTT CCGATTCCCT 540 

CGATGTGGGA TAGTAAGAAC TGGGATGGAG TTCTTGAGAT GTTAACATCA TGTCAAGCCA 6 00 

ATCCCATCTC AACATCTCAG ATGCATAAAT 630 

{2} INFORMATION . FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 63 0 base pairs 
<B) TYPE: nucleic acid 

(C) S TRANDEDNE SS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

ATTTATGCAT CTGAGATGTT GAGATGGGAT TGGCTTGACA TGATGTTAAC ATCT CAAGAA 60 
CTCCATCCCA GTTCTTACTA T1CCACATCG AGGGAA' *'.GG A^S^a^l"; ;4 \ATTTCCTGA ■ 120 

TCAAATTGTC AATATCATCA CTAATTAGAG GAGAATTCAA ATTGTAATCA GCATGATTCA 180 

AGTACGTCAT GCGCTCATCG GGATTCAGGA ATTCTCTTGT GGCATAGTCA TCTTCATTGA 240 

AATCATTGAA CTCGTCGGTC TCAAAATCGT GGACTTCCAT GATTGCTGTT AGTTTTTTTC 3 00 

ATAAAAATTA AAAACTCAAA TATAATTGAG GCCTCTTTGA GCATGGTATC ACAAGTTGAT 3 60 

TTGGTCCAAA CATGAAGAAT CTGCTAGGCG GCCGCAACTA GCTCCCGGGT TGGATATCTG 420 

TTAGTTTTTT TCATAGCTAG CAGGATTTGA GTTACTTTCC AAGTCGGTTC ATCTCTATGT 4 80 

CTGTATAAAT CTGTCTTTTC TTGGTGTGCT TTAATTTAAT GCAAAGATGG ATACCAACTC 540 

GGAGAACCAA GAATAGTCCA ATGATTAACC CTATGATAAA GAAAAAAGAG GCAATAGAGC 600 

TTTTCCAACT ACTGAACCAA CCTTCTACAA 630 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unkno wn 

(ii) MOLECULE TYPE: RNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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UCAGGAGAAA C 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY unknown 

(ii) MOLECULE TYPE: RNA 

(ix) FEATURE: 

(A) NAME / KEY : 5' Gppp 

(B) LOCATION: 1 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 16: 
AACAGUAAUC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(xi) BkyuEMCE DESCRIPTION: SEQ ID NO: 17: 
GAUUACUGUU AAAGUUUCUC CUGA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : RNA 



(ix) FEATURE: 

(A) NAME /KEY : polyA 

(B) LOCATION: 10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCUACAUAUG 

(2) INFORMATION /FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 
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(ix) FEATURE: 

(A) NAME /KEY : 5' Gppp 

(B) LOCATION: 1 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 



AACAGAUAUC 



10 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

■ (C) STRANDEDNESS : single 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GAUAUCUGUU AGUUUUUUUC AUAUGUAGC 29 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid , 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : RNA 

(ix) FEATURE: 

(A) NAME / KEY : polyA 

(B) LOCATION: 10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GUAGACUAUG 10 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: RNA 

(ix) FEATURE: 

(A) NAME /KEY : 5' Gppp 

(B) LOCATION: 1 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
AACAGAUAUC 10 
(2) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GAUAUCUGUU ACUUUUUUUC AUAGUCUAC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(ix) FEATURE: 

(A) NAME/ KEY: polyA 

(B) LOCATION: 10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
UAUCCCUAUG 1Q 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 

(ix) FEATURE : 

(A) NAME /KEY: 5' Gppp 

(B) LOCATION: 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AACAGAGAUC Q 
(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : RNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GAUCUCUGUU AGUUUUUUUC AUAGGGAUA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 



(ix) FEATURE: 

(A) NAME /KEY : polyA 

(B) LOCATION: 10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AAUUUUUAUG 10 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single- 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 

(ix) FEATURE : 

(A) NAME /KEY: 5' Gppp 

(B) LOCATION: 1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
AACAGCAAUC ±Q 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 29: 
GAUUGCUGUU AGUUUUUUUC AUAAAAAUU 2 ° 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: RNA - 

(ix) FEATURE: 

(A) NAME /KEY : polyA 

(B) LOCATION: 10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
UUUAAGUAUG 

10 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AGGAUCAAAG UUUUUUUCAU ACUUAAA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: ^7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32:' 
CATTCAAGAC GCTGCTTCGC AACTTCC 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
CATGAATGTT AACATCTCAA GA 

22 

(2) INFORMATION FOR SEQ ID NO: 34: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : RNA 

(ix) FEATURE: ■ 

(A) NAME /KEY : miscellaneous feature 

(B) LOCATION: 11 . . 12 

(D) OTHER INFORMATION: Intergenic dinucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GAUNNCUGUU ANUUUUUUUC AUA 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
UAUGAAAAAA A 

11 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
UUUUUUUCAU A 

11 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
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TATGAAAAAA A 

11 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 5 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CCGGCTCGAG TTGTAATACG ACTCACTATA GGGACGAAGA CAAACAAACC ATTATTATC 59 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GAACTCTCCT CTAGATGAGA AC 

22 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
AGGTCGGACC GCGAGGAGGT GGAGATGCCA TGCCGACCCA CGAAGAC CAC AAAACCAG 58 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
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ATGTTGAAGA GTGACCTACA CG 
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WHAT IS CLAIMED IS 



1. A recombinant replicable vesiculovirus, the 
genome of which comprises a foreign RNA sequence. 

5 

2. The vesiculovirus of claim 1 in which an RNA 
sequence complementary to said foreign RNA sequence encodes a 
peptide or protein that is expressed in a suitable host 
infected by the virus. 

10 

3. The vesiculovirus of claim l in which the 
peptide or protein is antigenic or immunogenic. 

4. The vesiculovirus of claim 2 in which the 
15 peptide or protein displays the antigenicity or 

immunogenicity of an antigen of a pathogenic microorganism. 

5. The vesiculovirus of claim 4 in .which the 
pathogenic microorganism is a virus. 

20 

6. The vesiculovirus of claim 4 in which the 
pathogenic microorganism is a bacterium. 

7. The vesiculovirus of claim 4 in which the 
25 pathogenic microorganism is a parasite. 

8. The vesiculovirus of claim 4 in which the 
pathogenic microorganism is a human pathogen. 

30 9 - The vesiculovirus of claim 4 in which the 

pathogenic microorganism is a non-human pathogen. 

10. The vesiculovirus of claim 2 in which the 
peptide or protein displays the antigenicity or 
35 immunogenicity of a tumor specific or tumor associated 
antigen. 
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11. Plasmid pVSVFL(+) , as deposited with the ATCC 
and assigned accession number 97134. 

12. A nucleic acid comprising the DNA sequence of 
5 plasmid pVSVFL(+) as depicted in Figure 1 from nucleotide 

numbers 623-12088 (a portion of SEQ ID NO:l), in which a 
region nonessential for vesiculovirus replication has been 
inserted into or replaced by foreign DNA. 

10 13. A host cell containing the recombinant 

vesiculovirus of claim 1. 

14 . A method of producing a recombinant 
vesiculovirus comprising culturing a cell containing (a) a 

15 recombinant nucleic acid that can be transcribed to produce 
an RNA molecule comprising a vesiculovirus antigenomic ( + ) 
RNA, in which a region of the RNA nonessential for 
replication of the vesiculovirus has been inserted into or 
replaced by a forei^iv TvNa sequence; (b) a second recombinant 

20 nucleic acid encoding a vesiculovirus N protein; (c) a third 
recombinant nucleic acid encoding a vesiculovirus L protein; 
and (d) a fourth recombinant nucleic acid encoding a 
vesiculovirus P protein; whereby the first recombinant 
nucleic acid is transcribed in the cell to produce said RNA 

25 molecule, and the N, L and P proteins are expressed in the 
cell, and a recombinant replicable vesiculovirus is produced 
that has a genome that is the complement of said antigenomic 
RNA comprising said foreign RNA sequence. 

3 0 15. The method according to claim 14 in which the 

cell is a mammalian cell. 

16. The method according to claim 14 in which the 
first recombinant nucleic acid comprises a first promoter 
35 sequence which controls the expression of the RNA molecule in 
the cell; the second recombinant nucleic acid comprises a 
second promoter sequence which controls the expression of the 
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N protein in the cell; the third recombinant nucleic acid 
comprises a third promoter sequence which controls the 
expression of the P protein in the cell; and the fourth 
recombinant nucleic acid comprises a fourth promoter sequence 
5 which controls the expression of the L protein in the cell. 

17. The method according to claim 14 in which said 
RNA molecule further comprises a ribozyme sequence 
immediately downstream of said antigenomic ( + ) RNA, that 

10 cleaves at the 3' terminus of the antigenomic (+) RNA. 

18. The method according to claim 14 in which the 
first recombinant nucleic acid is a DNA plasmid vector 
comprising the following operatively linked components: 

15 (a) a promoter; 

(b) a DNA sequence that can be transcribed 
under the control of the promoter in the 
cell to produce said RNA molecule;, and 

(c) a transcription termination signal. 

20 

19. The method according to claim 14 or 18 in 
which the second recombinant nucleic acid is a DNA plasmid 
vector comprising the following operatively linked 
components : 

25 (a) a promoter which controls the expression 

of the N protein; 

(b) a translation initiation signal ; 

(c) a DNA sequence encoding the N protein; 
and 

30 (d) a transcription termination signal; and 

in which the third recombinant nucleic acid is a DNA plasmid 
vector comprising the following operatively linked 
components : 

(a) a promoter which controls the expression 
35 of the P protein; 

(b) a translation initiation signal; 
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(c) a DNA sequence encoding the P protein; 
and 

(d) a transcription termination signal; and 
and in which the fourth recombinant nucleic acid is a DNA 

5 plasmid vector comprising the following operatively linked 
components : 

(a) a promoter which controls the expression 
of the L protein; 

(b) a translation initiation signal; 

10 ( c ) a DNA sequence encoding the L protein; 

and 

(d) a transcription termination signal. 

20. The method according to claim 17 in which the 
15 first recombinant nucleic acid is a DNA plasmid vector 

comprising the following operatively linked components: 

(a) a promoter; 

(b) a DNA sequence that can be transcribed 
under th* control of the promoter in the 

20 cell to produce said RNA molecule; and 

(c) a transcription termination signal. 

21. The method according to claim 19 in which the 
first recombinant nucleic acid, the second recombinant 

25 nucleic acid, the third recombinant nucleic acid, and the 
fourth recombinant nucleic acid, each further comprises a 
selectable marker. 

22. The method according to claim 14 in which the 
30 second, third and fourth recombinant nucleic acids form part 

of a single recombinant nucleic acid. 

23. The method according to claim 16 in which the 
first, second, third and fourth promoter sequences are RNA 

35 polymerase promoter sequences for the same RNA polymerase, 
and in which the cell also contains a cytoplasmic source of 
said RNA polymerase. 
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24. The method according to claim 23 in which the 
cytoplasmic source of said RNA polymerase is a recombinant 
vaccinia virus expressing said RNA polymerase in the cell. 

5 25. A method of producing a recombinant 

vesiculovirus comprising culturing a mammalian cell 
containing: 

(a) a first DNA plasmid vector comprising the 
following operatively linked components: 

10 (i) a bacteriophage RNA polymerase promoter; 

(ii) a first DNA sequence that is transcribed 
in the cell to produce an RNA molecule 
comprising (A) a vesiculovirus 
antigenomic ( + ) RNA in which a region of 

15 the RNA nonessential for replication of 

' the vesiculovirus has been inserted into 
or replaced by a foreign RNA sequence, 
and (B) a ribozyme sequence immediately 
downstream of said antigenomic (+) 7s2V+, 

2 0 that cleaves at the 3' terminus of the 

antigenomic RNA; and 
(iii) a transcription termination signal for 
the RNA polymerase; 

(b) a second DNA plasmid vector comprising the 
25 following operatively linked components: 

(i) the bacteriophage RNA polymerase 
promoter ; 

(ii) a second DNA sequence encoding an N 
protein of the vesiculovirus; and 

3 0 (iii) a second transcription termination signal 

for the RNA polymerase; 

(c) a third DNA plasmid vector comprising the 
following operatively linked components: 

(i) the , bacteriophage RNA polymerase 
35 promoter; 

(ii) a third DNA sequence encoding a P protein 
of the vesiculovirus; and 
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(iii) a third transcription termination signal 
for the RNA polymerase; 

(d) a fourth DNA plasmid vector comprising the 
following operatively linked components: 

5 (i) the bacteriophage RNA polymerase 

promoter ; 

(ii) a fourth DNA sequence encoding an L 
protein of the vesiculovirus; and 
(iii) a fourth transcription termination signal 
10 for the RNA polymerase; and 

(e) a recombinant vaccinia virus comprising a 
sequence encoding the bacteriophage RNA polymerase; whereby 
in said cell the first DNA sequence is transcribed to produce 
said RNA molecule , the N, P, and L proteins and the 

15 bacteriophage RNA polymerase are expressed, and a recombinant 
replicable vesiculovirus is produced that has a genome that 
is the complement of said antigenomic RNA comprising said 
foreign RNA sequence. 

20 26. The method according to claim 25 in which the 

ribozyme sequence is the sequence of the hepatitis delta 
virus ribozyme, and the bacteriophage RNA polymerase is the 
T7 RNA polymerase. 

25 27. The method according to claim 14 in which the 

foreign RNA sequence is the complement of a sequence encoding 
a protein or peptide that is immunogenic or antigenic. 

28. The method according to claim 25 in which the 
30 foreign RNA sequence is the complement of a sequence encoding 

a protein or peptide that is immunogenic or antigenic. 

29. The method according to claim 27 or 28 in 
which the protein or peptide displays the antigenicity or 

35 immunogenicity of an antigen of a pathogenic microorganism. 
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30. The method according to claim 27 or 2 8 in 
which the protein or peptide displays the antigenicity or 
immunogenicity of a tumor specific antigen* 

5 31. The method according to claim 25 which further 

comprises before step (a) the step of introducing said first, 
second, third and fourth plasmid vectors and said recombinant 
vaccinia virus into said cell. 

10 32. A host cell comprising (a) a recombinant 

nucleic acid that can be transcribed to produce an RNA 
molecule comprising a vesiculovirus antigenomic (+) RNA, in 
which a region of the RNA nonessential for replication of the 
vesiculovirus has been inserted into or replaced by a foreign 

15 RNA sequence; (b) a second recombinant nucleic acid encoding 
a vesiculovirus N protein; (c) a third recombinant nucleic 
acid encoding a vesiculovirus L protein; and (d) a fourth 
recombinant nucleic acid encoding a vesiculovirus P protein. 

20 33. A host cell comprising: 

(a) a first DNA plasmid vector comprising the 
following operatively linked components: 

(i) a bacteriophage RNA polymerase promoter; 
(ii) a first DNA sequence that is transcribed 
25 in the cell to produce an RNA molecule 

comprising (A) a vesiculovirus 
antigenomic ( + ) RNA in which a region of 
the RNA nonessential for replication of 
the vesiculovirus has been inserted into 
3 0 or replaced by a foreign RNA sequence, 

and (B) a ribozyme sequence immediately 
downstream of said antigenomic ( + ) RNA, 
that cleaves at the 3' terminus of the 
antigenomic RNA; and 
35 (iii) a transcription termination signal for 

the RNA polymerase; 
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(b) a second DNA plasmid vector comprising the 
following operatively linked components: 

(i) the bacteriophage RNA polymerase 
promoter; 

5 (ii) a second DNA sequence encoding an N 

protein of the vesiculovirus; and 
(iii) a second transcription termination signal 
for the RNA polymerase; 

(c) a third DNA plasmid vector comprising the 
10 following operatively linked components: 

(i) the bacteriophage RNA polymerase 
promoter; 

(ii) a third DNA sequence encoding a P protein 
of the vesiculovirus; and 
15 (iii) a third transcription termination signal 

for the RNA polymerase; 

(d) a fourth DNA plasmid vector comprising the 
following operatively linked components: 

(i) the bac^iicph^gci RNA polymerase 
20 promoter; 

(ii) a fourth DNA sequence encoding an L 
protein of the vesiculovirus; and 
(iii) a fourth transcription termination signal 
for the RNA polymerase; and 
25 (e) a recombinant vaccinia virus comprising a 

sequence encoding the bacteriophage RNA polymerase; whereby 
in said cell the first DNA sequence is transcribed to produce 
said RNA molecule, the N, P, and L proteins and the 
bacteriophage RNA polymerase are expressed, and a recombinant 
30 replicable vesiculovirus is produced that has a genome that 
is the complement of said antigenomic RNA comprising said 
foreign RNA sequence. 

« 

34. A vaccine formulation comprising an effective 
35 immunizing amount of the vesiculovirus of claim 3; and a 
pharmaceutical^ acceptable carrier. 
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35. A vaccine formulation comprising an effective 
immunizing amount of the recombinant vesiculovirus of claim 
4; and a pharmaceutical^ acceptable carrier. 

5 3 6. A vaccine formulation comprising an effective 

immunizing amount of the recombinant vesiculovirus of claim 
10; and a pharmaceutical^ acceptable carrier. 

37. A vaccine formulation comprising an effective 
10 immunizing amount of a first recombinant vesiculovirus of 

claim 3; an effective immunizing amount of a second 
recombinant vesiculovirus of claim 3; and a pharmaceutically 
acceptable carrier; in which said peptide or protein 
expressed by the first and second recombinant vesiculoviruses 
15 are different. 

38. The vaccine formulation of claim 37 in which 
the peptide or protein expressed by the first recombinant 
vesiculovirus displays Live antigenicity or immunogenic ity of 

20 an antigen of a first pathogenic microorganism, and the 
peptide or protein expressed by the second recombinant 
vesiculovirus displays the antigenicity or immunogenicity of 
an antigen of a second, different pathogenic microorganism. 

25 39. A method of treating or preventing a disease 

or disorder in a subject comprising administering to the 
subject an effective immunizing amount of the recombinant 
vesiculovirus of claim 3 . 

30 4 0. A method of treating or preventing a disease 

or disorder in a subject caused by infection by a pathogenic 
microorganism comprising administering to the subject an 
effective immunizing amount of the recombinant vesiculovirus 
of claim 4. 

35 

41. A method of treating or preventing a subject 
having a tumor comprising administering to the subject an 
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effective immunizing amount of the recombinant vesiculovirus 
of claim 10* 

42. The method according to claim 39 in which the 
5 subject is a human. 

43. The method according to claim 39 in which the 
subject is a non-human animal. 

10 44. A kit comprising in one or more containers (a) 

a first recombinant nucleic acid that can be transcribed in a 
host cell to produce an RNA molecule comprising a 
vesiculovirus antigenomic ( + ) RNA, in which a region of the 
RNA nonessential for replication of the vesiculovirus has 

15 been inserted into or replaced by a foreign RNA sequence; (b) 
a second recombinant nucleic acid encoding and capable of 
expressing in the host cell a vesiculovirus N protein; (c) a 
third recombinant nucleic acid encoding and capable of 
fcxpiesaing in the host cell a vaslr^lov^rus L protein; and 

2 0 (d) a fourth recombinant nucleic acid encoding and capable of 
expressing in the host cell a vesiculovirus P protein. 

45. A kit comprising in one or more containers (a) 
a first recombinant nucleic acid that can be transcribed in a 

25 host cell to produce an RNA molecule comprising a 

vesiculovirus antigenomic (+) RNA; (b) a second recombinant 
nucleic acid encoding and capable of expressing in the host 
cell a vesiculovirus N protein; (c) a third recombinant 
nucleic acid encoding and capable of expressing in the host 

30 cell a vesiculovirus L protein; and (d) a fourth recombinant 
nucleic acid encoding and capable of expressing in the host 
cell a vesiculovirus P protein. 

46. A kit comprising in one or more containers (a) 
35 a recombinant nucleic acid that can be transcribed to produce 

an RNA molecule comprising a vesiculovirus antigenomic ( + ) 
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RNA; and (b) a host cell that recombinantly expresses 
vesiculovirus N, P, and L proteins. 

47. The kit of claim 45 in which said first 
5 recombinant nucleic acid further comprises a polylinker in a 
region that is transcribed to produce said RNA molecule, said 
region being nonessential for replication of the 
vesiculovirus. 

10 48 • The according to claim 45 in which the 

second recombinant nucleic acid is a DNA plasmid vector 
comprising the following operatively linked components: 

(a) a promoter which controls the expression 
of the N protein; 
15 ( b ) a translation initiation signal; 

(c) a DNA sequence encoding the N protein; 
and 

(d) a transcription termination signal; and 
in which the third recombinant nuclei e acid is a DNA plasmid 

2 0 vector comprising the following operatively linked 

components: 

(a) a promoter which controls the expression 
of the P protein; 

(b) a translation initiation signal; 

25 < c ) a DN A sequence encoding the P protein; 

and 

(d) a transcription termination signal 
and in which the fourth recombinant nucleic acid is a DNA 
plasmid vector comprising the following operatively linked 

3 0 components: 

(a) a promoter which controls the expression 
of the L protein; 

(b) a translation initiation signal; 

(c) a DNA sequence encoding the L protein; 
35 and 

(d) a transcription termination signal; and 
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in which the first recombinant nucleic acid is a DNA plasmid 
vector comprising the following operatively linked 
components : 

(a) a promoter; 
5 (b) a DNA sequence that can be transcribed 

under the control of the promoter in the 
cell to produce said RNA molecule, said 
RNA molecule further comprising a 
ribozyme sequence immediately downstream 
10 of said antigenomic (+) RNA, that cleaves 

at the 3' terminus of the antigenomic (+) 
RNA, said DNA comprising a polylinker in 
a region that is transcribed to produce 
said RNA molecule, said region being 
15 nonessential for replication of the 

vesiculovirus; and 
(c) a transcription termination signal. 

^9. The kit of claim 4S in v:hich the promoter is 
20 an RNA polymerase promoter, and in which the cell also 
contains a cytoplasmic source of said RNA polymerase. 

50. A kit comprising 

(a) in a first container a first DNA plasmid 
25 vector comprising the following operatively linked 
components : 

(i) a bacteriophage RNA polymerase promoter; 
(ii) a first DNA sequence that can be 

transcribed in a cell to produce an RNA 

30 ■ molecule comprising (A) a vesiculovirus 

antigenomic (+) RNA in which a region of 
the RNA nonessential for replication of 
the vesiculovirus has been inserted into 
or replaced by a foreign RNA sequence, 

35 and (B) a ribozyme sequence immediately 

downstream of said antigenomic ( + ) RNA, 
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that cleaves at the 3' terminus of the 
antigenomic RNA; and 
(iii) a transcription termination signal for 
the RNA polymerase; 
5 (b) in a second container a second DNA plasmid 

vector comprising the following operatively linked 
components : 

(i) the bacteriophage RNA polymerase 
promoter; 

10 (ii) a second DNA sequence encoding an N 

protein of the vesiculovirus; and 
(iii) a second transcription termination signal 
for the RNA polymerase; 

(c) in a third container a third DNA plasmid 
15 vector comprising the following operatively linked 

components : 

(i) the bacteriophage RNA polymerase 
promoter; 

(ii) a third DNA sequence encoding a P protein 
20 of the vesiculovirus; and 

(iii) a third transcription termination signal 
for the RNA polymerase; 

(d) in a fourth container a fourth DNA plasmid 
vector comprising the following operatively linked 

25 components: 

(i) the bacteriophage RNA polymerase 
promoter; 

(ii) a fourth DNA sequence encoding an L 
protein of the vesiculovirus; and 
30 a f^rth transcription termination signal 

for the RNA polymerase; and 

(e) in a fifth container a recombinant vaccinia 
virus comprising a sequence encoding the bacteriophage RNA 
polymerase. 

51. A kit comprising in a container an effective 
immunizing amount of the vesiculovirus of claim 3. 
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52. The vesiculovirus of claim 1 which is a 
vesicular stomatitis virus. 

53. The vesiculovirus of claim 4 in which the 
5 peptide or protein displays the antigenicity or 

immunogenicity of an envelope glycoprotein of a virus other 
than a vesiculovirus. 

54. The vesiculovirus of claim 53 in which the 

10 envelope glycoprotein is an envelope glycoprotein of a Human 
Immunodeficiency Virus. 

55. The vesiculovirus of claim 53 in which the 
peptide or protein is incorporated into the vesiculovirus 

15 envelope. ■ * 

56. The vesiculovirus of claim 53 in which the 
peptide or protein is expressed as a fusion protein 
cuiupiioing tn e cytoplasmic domain of c. vccirjilovirus G 

2 0 protein. 

57. The vesiculovirus of claim 56 in which the 
native G protein of the vesiculovirus is not expressed. 

25 58 • The vesiculovirus of claim 56 in which the 

native G protein of the vesiculovirus is also expressed. 

59. The vesiculovirus of claim 3 in which a second 
RNA sequence complementary to said foreign RNA sequence 
30 encodes a second peptide or protein that is expressed in the 
suitable host, in which the first peptide or protein and the 
second peptide or protein display different antigenicity or 
immunogenicity . 

35 60 • An inactivated recombinant vesiculovirus that 

is the product of a method comprising inactivating a 
recombinant replicable vesiculovirus, the genome of which 
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comprises a foreign RNA sequence, in which an RNA sequence 
complementary to said foreign RNA sequence encodes a peptide 
or protein that is expressed in a suitable host infected by 
the recombinant replicable vesiculovirus, and in which the 
5 peptide or protein is antigenic or immunogenic. 

61. The vesiculovirus of claim 60 in which the 
peptide or protein displays the antigenicity or 
immunogenicity of an antigen of a pathogenic microorganism. 

10 

62. The vesiculovirus of claim 60 in which the 
peptide or protein displays the antigenicity or 
immunogenicity of a tumor specific or tumor associated 
antigen. 

15 

63. The vesiculovirus of claim 60 in which a 
second RNA sequence complementary to said foreign RNA 
sequence encodes a second peptide or protein that is 
expressed in the suitable host, in which the first peptide or 

20 protein and the second peptide or protein display different 
antigenicity or immunogenicity. 

64. The vesiculovirus of claim 60 in which the 
peptide or protein displays the antigenicity or 

25 immunogenicity of an envelope glycoprotein of a virus other 
than a vesiculovirus. 

65. The vesiculovirus of claim 64 in which the 
envelope glycoprotein is an envelope glycoprotein of. a Human 

30 Immunodeficiency Virus. 

66. The vesiculovirus of claim 64 in which the 
peptide or protein is incorporated into the vesiculovirus 
envelope. 

35 

67. The vesiculovirus of claim 64 in which the 
peptide or protein is expressed as a fusion protein 
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comprising the cytoplasmic domain of a vesiculovirus G 
protein. 

68. A vaccine formulation comprising an effective 
5 immunizing amount of the vesiculovirus of claim 60; and a 

pharmaceutical ly acceptable carrier. 

69. A vaccine formulation comprising an effective 
immunizing amount of the vesiculovirus of claim 64; and a 

10 pharmaceutical^ acceptable carrier. 

70. A vaccine formulation comprising an effective 
immunizing amount of the vesiculovirus of claim 66; and a 
pharmaceutical ly acceptable carrier. 

15 

71. A method of treating or preventing a disease 
or disorder in a subject comprising administering to the 
subject an effective immunizing amount of the inactivated 
recombinant vesiculovirus of claim 60. 

20 

72. A kit comprising in a container an effective 
immunizing amount of the inactivated recombinant 
vesiculovirus of claim 60. 

25 73. A method for producing a peptide or protein 

comprising culturing a cell containing the recombinant 
replicable vesiculovirus of claim 2 in said suitable host; 
and recovering the expressed peptide or protein. 

30 74. A method for producing an inactivated virus 

vaccine comprising inactivating a recombinant replicable 
vesiculovirus, the genome of which comprises a foreign RNA 
sequence, in which an RNA sequence complementary to said 
foreign RNA sequence encodes a peptide or protein that is 

35 expressed in a suitable host infected by the recombinant 

replicable vesiculovirus, and in which the peptide or protein 
is antigenic or immunogenic. 

- 132 - 



BNSDOCID: <WO. 



9634625A1_1_> 



c 



WO 96/34625 



PCTYUS96/06053 



Internationa] Application No: PCT/ 



MICROORGANISMS 

Optional Sheet in connection with the microorganism referred to on page 21, lines 15-35 of the description ' 



A. IDENTIFICATION OF DEPOSIT * 

Further deposits are identified on an additional sheet 1 



Name of depositary institution ' 
American Type Culture Collection 



Address of depositary institution (including postal code and country) ' 

12301 Parklawn Drive 
Rockville. MD 20852 
US 



Date of deposit ' May 2, 1995 Accession Number * 97134 



B. ADDITIONAL INDICATIONS ' (leave blank if no? applicable). This information is continued on a separate attached sheet 



C. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ' 



tDdjtauotB IRinill dotpaud boa) 



D. SEPARATE FURNISHING OF INDICATIONS ' (leave blank if not applicable) ' 



The indications bsted below will be submitted to the International Bureau later • (Specify the general nature of the indications e.g., 
'Accession Number of Deposit*) 



E. B^This sheet was received with the International application when filed (to be checked by the receiving Office) 




(Authorized Officer) 



□ The date of receipt (from the applicant) by the International Bureau 



was 



(Authorized Officer) 



Form PCT/RO/134 (January T981) 



- 133 - 



BNSDOCiD: <WO. 



9634625A1_I_> 



WO 96/34625 



PCT/US96/06053 



CACCTAAATrGTAAG CGTTAAT ATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGCTCATTTTTTAACCAA7AGGCC 
GTGGATTIAACATTCGCAATTATAAAACAATTTTAACCGCAATTTAAAAACAATTTAGTCGAGTAAAAAATTGGTTATCCCGCTTTAGCC 



GAAATCGG 

<■ 90 



CAAAATCCCT7ATAAATCAAAAGAATAGACCGAGATAGGGT7GAGTGT7GTTCCAGTTTGGAACAAGAGTCCACT ATT AAAGAACGTGGA 
G7777AGGGAA7A777AGT7T7CT7ATCTGGC7CTA7CCCAAC7CACAACAAGG7CAAACC77GTTCTCAGG7GA7AA7TTC77GCACC7 W 



BsaA I 



C7CCAACC7CAAAGGG CGAAAAACCC7C7ATGAGGGCGA7GGCCCAC7ACCTGAACCA7CACCC7AA7CAAG7T77T7GCGG7C GAGG7G 
CAGG77GCAGTT7CCCGC77T77GGCAGATAGTCCCGC7ACCGGG7GA7GCACTTGG7AG7GGGATTAGUCAAAAAACCCCAGC 



+ 270 



7CCAC 



CCG7AAAGCAC7AA ATCCGAACCC7AAAGGGA6CCGCCGA777AGAGC7TGACGGGGAAAGCCGGCGAACG7GGCGAGAAA GG 
CCCAH7CG7GA777AGCC77GGGATTTCCC7CGGGGGC7AAA7CTCGAACTGCCCC777CGCCCGC77GCACCGC7CTITCC77CCC7T 350 



AAGGGAA 



GAAACCGAAAGG AGCGGGCGCTAGGGCGCTGGCAAGTG T AGCGGTCACGCTGCGCGT AACC ACCACACCCGCCGCGCTTAATG CGCCGCT 

ctttcgctttcctcgcccgcgatcccgcgaccgttcacatcgccagtgcgacgcgcattggtggtgtgggcggcgcgaaItacgcggcga 



450 



acagggcgcgtcccattcgccattcaggctgcgcaactgttgggaagggcgatcggt gcgggcctcttcgctatt 
tgtcccgcgcagggt aagcggtaagtccgacgcgttljacaacccTt^TcgTta^ 



acgccagctggcgaa 

1 H 1- 

gcggtcgaccgctt 



540 



agggggatgtcctgc aaggcgattaagttgggtaacgccaggcttttcccagtcacgacgttgtaaaa cgaccgccagtgaatt 
tccccctacacgacgttccgctaattcaacccattgcggtcccaaaagggtcagtgctgcaacatttt 



CTAATA 



GCTGCCGGTCACT7AACATTA7 

E= 

Apal Xhol 

CGACTCACTATACCGCGAATTCCCTACCGCCCCCCCCCTCCACTTCTAAT^r^rTr.rT.T.rr^r. CAAACCATU1 U 



630 



GCTGAGTGATATCCCGCTTAACCCATGGCCCGGGGGGGAGCTCAACATTATGClGAGTGATATCCCrCCUCTClTTCnT 



CCI AAT AA7 



720 



77 P« 



T7 P 



leader 



TCAT7AAAAGGCTCA GGAGAAACT7TAACAG7AATCAAAA7G7CtG77ACAG7CAAGAGAA7CA77GACAACACAGTCA7AGT7 CCAAAA 

AC7AA7777CCGAGTCC7CT77GAAA7TG7CAT7AGT7T7ACAGACAA7G7CAGT7C7C77AG7AAC7GT7G7C7CAG7A7CAAGG77T7 8 '° 
leader 



5 



r 



N 



Met Sec Vol 7hr Vol Lys Ang He U e Asp Asn Thr Vol He Vol Pro Lys 
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Bam H J 



CTTCCTGCAAATGACGATCCAGTGGAATACCCGGCAGATTACTTCAGAAAATCAAAGGAGATTCCTCTTTACATCAATACTACA AAAAGT 
GAAGGACGTTTACTCC T AGGTCACC T7 A7GGGCCGTCT AATGAAGTCTT7TAGTTTCCTCTAAGGAGAAATGTAGTTATGATCT 7*7 TCA 



N 



Leu Pro f,o Asn Glu Asp Pro Vol Glu Tyr Pro A,o A 5D 7 y r Pne Arg L ys Ser Lys Cki Me Pro Leu 7yr | le Asn Thr Tnr Ly s Ser 

nGTCACA-CV.AGACGA7ATG7CTACCAAGGCCKAAATCCCGAAA7G7ATCAA7CA7ACA7C7CAACACCTAC7TC7A^ GAGC/. * ' A 
AACAC^C:ACA:TC T CC7/ i TACACATCGTKCGCAG;i7AGGCCr7TACA7AG77AG7A^GTACAGI7C7CCATGAACA7ACC:CC-A/. : "° 



N 



Leu Ser Aso Leu Arg Sly Tyr Vol Tyr Gin G.y Leu Lys Ser Gly Asn Vol Ser ]| e He H, S Vol Asn Ser T y r Leu Tyr G y A o Leu 
AAGGACA7CCCCGGTAAGTTGGA7AAAGATTGGTCAACTTTCGGAA7AAACATCGGCAAAGCAGGGGATACAATCCGAA7ATTT CACCT- 

ttcctgtaggcccca77caacctatt7ctaaccagttcaaagccttatttgtagccctttcgtcccctatgttagccttataaac7ggaa 1080 



N 



Lys Asp ile Arg Gly Lys Leu Asp Lys Asp Trp Ser Ser Phe Gly lie Asn He Gly Lys AJo G.y Aso Thr He Gly He Phe Asp Leu 

GTATCCT7GAAAGCCC 7GGACGGCGTACTTCCAGATGGAGTATCGGATGCTTCCAGAACCAGCGCAGATGACAAATGGTTG CCT7TGTAT 
CA7AGGAACTT7CGGGACC TGCCGC ATGAAGGTCT ACCTC ATAGCCTACG AAGGTCTTGGTCGCGTC7AC TGTTT ACCAAC 



GGAAACA7 A 



1 170 



N 



Vol Ser Leu Lys Ala Leu Asp Gly Vol Leu Pro Asp Gly Vol Ser Asp A!o Ser Arg Thr Ser Alo Asp Asp Lys Trp Leu Pro Leu Tyr 

"' *' * ■ » , 
-. , \ 

CTACTTGGCTTATACAGAGTGGGCAGAACACAAATGCCTGAATACAGAAAAAAGCTCATGGATGGGCTGA^ 

gatgaaccgaatatgtctcacccgtcttgtgtttacggacttatgtcttItttcgagtacctacccgacIgtttagttacgtttt 



+ 1260 



ACTAG 



N 



Met lie 



Leu Leu Gly Leu Tyr Arg Vol Gly Arg Thr Gin Met Pro Glu Tyr Arg Lys Lys Leu Met Asp Gly Leu Thr Asn Gin Cys Lys 

AATGAACAGT77GAACCTCTTGTGCCACAAGGTCGTGACATTTTTGATGTGTGGGGAAATGACAGTAATTACACAAAAATTG7CGCTGCA 
T7ACTTGTCAAACT7GGAGAACACGG7CTTCCaGCAC7g{aAAAACTaCACACCCCT7TAC7G7CA7TAATG7G7T777a A CaCCGACC7 



1350 



N 



Asn Glu Cn Pne Glu Pro Leu Vol Pro Glu G.y Arg Aso He Phe Asp Vol Trp Gly Asn Asp Ser Asn Tyr Thr Lys He Vol Aio A«o 

CTGGACATCTTC7TCCACATGTTCAAAAAACATGAATGTGCCTCGTTCAGATACGGAACTATTGT7TCCAGA77C AAAGA7TGTGCTGCA 
CACCTCTACAAGAAGGTGTACAAGTTTTTTGTACTTACACGGAGCAAGTCTATGCCTTGATAACAAAGGTCTAAGTTTCTAACACGACci 



Vol Asp Met Phe Pne H.s Met Phe Lys Lys H>s Glu Cys Alo Ser Pne Arg Tyr Gly Thr lie Vol Ser Arg Phe Lys Asp Cys A.o A.o 
TTGGCAACATTTGGACACCTC7GCAAAATAACCGGAATCTCTACAGAAGATGTAACGACCTGGATC7TGAACCGAGAACTTG CAGATCAA 

aaccg77G7aaacctgtggagacgttttattggcct7acagatgtct7ctacattgctggacc7agaact7ggctcttcaacctctactJ 1530 



N 



Leu Alo Thr Pne Gly H« Leu Cys Lys lie Thr Gly Mel Ser Thr Glu Asp Vol Thr Thr Trp lie Leu Asn Arg Glu Vol Alo Asp Cu 
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ATGGTCCAAATGATGCTTCCAGGCCAAGAAATTGA CAAGGCCGATTCATACATGCCTTATTTGATCGA Cf TTRKATTf^TrTTrT a apti-t 

taccaggtttactacgaaggtccggttctttaactgttcccgctaagtatgtacggaataaactagctgaaacct aacagaagattcaga ,62 ° 



N 



Me, Vo, On Me, Met Le 0 Pro G« y G,n Glu He Asp Lys Ato Asp Ser V Me, Pro V Leu He Asp Phe By Leu Ser Ser Lys Ser 
CCATATTCTTCCGTCAAAAACCCTGCCTTCCACTTCTGG GGGCAATTGACAGCTCTTCTCCTCAGATCCA CCA.A.rAA.. 

ggtataagaaggcagtttttgggacggaaggtgaagacccccgttaactgtcgagaagacgagtctagg'tggt ctcgttccttacgggct ' 7 '° 



N 



Pro Tyr Ser Ser Vo, Lys Asn Pro A,o Phe H,s Phe Irp CIy Gln Leu 7hr a1o Leu Uu Leu ^ Sfir ^ ^ ^ ^ ^ ^ ^ 

| Bsm071 .Bam HI 

CAGCCTGATGACATTGAGIATACATCTCTTACTACA GCAGGTTTGTTGTACGCTTATGCAGTAGGATCCTC TGCCGACTTGGCACAACAn 
GTCGGACTACTGTAACTCATATGTAGAGAATGATGTCGTCCAAACAACATGCGAATACGTCATCCrAGGAGACGGCTGAACCGTCTlGTC "°° 



N 



Bn Pro asp Asp He G,u T y r Thr Ser Leu Thr TMr A to G , y Leu Leu Tyr A,o , y r Wo Vo, Gly Ser Ser A,o Asp Leu A,o On Gln 

TTTTGTGTTGGAGATAACAAATACACTCCAGATGA7AG TACCGGAGGATTGACGACTAATGCACCGCCACA AGGrAf:Af:A Tnrr 

AAAACACAACCTCTATTGTT7ATGTGAGGTCTACTATCATGGCC7CCTAACTGCTCATTACGTGGCGGTGTTC CGTCTCT ACACCAGCTT ' 890 



N 



Phe Cy s vo, G. V Asp Asn L y s T y r Tnr Pro Asp Asp Ser Tar Gly G, y Leu Thr Thr Asn A,o Pro Pro G,n G, y Arg Asp Vo, Vo, G,u 
TGGCTCGGArGGTTTGAAGATCAAAACAGAAAACCGACTCCTGATATGATGCAGTATGCGAAAAGAGCAGTr.ATr.TrArr.r^r 



I960 



N 



Trp Leu G, y Trp Pne Ghj Asp Gln Asn Ar 9 L y s Pro Th r Pro Asp Me, Me, Gin Tyr A,o Lys.Arg A,o Vo, Met Ser Leu Gln G, y Leu 

AGAGAGA AGACAATTGGCAAGTATGCTAAGTCAGAATTTGACAAATGACCCTATAATTCTCAGATCACC TATTATATAr^rr T . 

TCTCTCTTCTGTTAACCGTTCATACGATTCAGTCTTAAACTGTnACTGGGATATTAAGAGTCTAGTGGATAATATATAATACGATGTAT ^ 



N 



Arg G,u L y s Ihr |l e Gl y L y s T y r Alo L^ Ser G,u Pne Asp Lys • 

^AAAAAAACTAACAGATATCATGGATAATCTCACAA AAGTTCGTGAGTATCTCAAGTCCTATTCTCGTCT GGATr.A^r.o^rcr.r ' 
ACTTTTTTTGATTGTCTATAGTACCTATTAGAGTGTTTTCAAGCACTCAlAGAGrTCAGGAlAAGAGCAGACCTA GTCCGCCATCCTCTC 



Me, Asp Asn Leu Thr Lys Vo, Arg Glu V Leu Lys Ser V Ser Arg Leu Asp Gln Alo Vo, Gly Glu 

atagatga gatcgaagcacaacgagctgaaaagtccaattatgagttgttccaagaggatggagtggaagag cat actaagccctcttat 

TATCTACTCTAGCrTCGTGTTGCTCGACTTTTCAGCTTAATACTCAACAAGGTTCTCCTACCTCACCTTCTCGTATGATTCGGGAGAATA 2250 



«e asp G,u „e Giu Alo G,n Arg A,o Glu Lys Ser Asn Tyr Glu LeuPhe Gln Glu Asp Gly Vo, Glu Glu H« Thr L y s Pro Ser V 
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TTTCAGGCAGC AGATGATTCTGACACAGAATCTGAACCAGAAATTGAAGACAATCAAGGTTTGTATGCACC AGATCC AGAAGC TGAGCAA 

aaagtccgtcgtctactaagactgtgtcttagacttggtctttaacttcIgttagttcca'aacatacgtggtctaggtcItcgactcgt! 23t *° 



Phe Gin Alo Alo Asp Asp Ser Asp Thr Glu Ser Glu Pro Glu lie Glu Asp Asn Gin Gly Leu Tyr Alo Pro Asp Pro Glu Ale Glu Gin 

gttgaaggctttatacaggggcctttagatgactatgcagatgaggaagtggatgttgtatttacttcggactggaaacagcc tgagctt 
caacttccgaaatatgtccccggaaatctactgatacgtctactccttcacctacaacataaatgaagcctgacctttgkggactcgaa 



2^130 



Vol Glu Gly Phe lie Gin Gly Pro Leu Asp Asp Tyr Ato Asp Glu Glu Vol Asp Vol Vol Pne Thr Ser Asp Trp Lys Gin Pro 



Glu Leu 



GAATC TGACGAGC ATGGAAAGACCTTACGGTTGACATCGCC AGAGGGTTTAAGTGGAGAGCAGAAATCCC AGTGGC T TTCGACGA TT AAA 
^^^ A ^ A ^^GCTCGTACCTTTCTGGAATGCCAACTGTAGCGGTCTCCCAAA TTCACCTCTCGTCTTTAGGGTC ACCGAAAGC TGCT AATTT 2520 



— p. 



Glu Ser Asp G.u H,s Gly Lys Thr Leu Arg Leu Thr Ser Pro Glu Gly Leu Ser Gly Giu Gin Lys Ser Gin Trp Leu Ser Thr He Lys 

GCAGTCCTGCAAAGTGCCAAATACTGGAATCTGGCAGAGTGCACATTTGAAGCATCGGGAGAAGGGGTCATTATGAAGGA GCGCCAGATA 
CGTCAGCACGTTTCACGGTTTATGACCTTAGACCGTCTCACGTGTAAACTTCGTAGCCCTCTTCCCCAGTAATACTTCCT 



CGCGGTCTAT 



2610 



A.o Vol vo. Gin Ser A»o Lys Tyr Trp Asn Leu Alo Glu Cys Thr Pne Glu Alo Ser Gly Glu Gly Vol He Met Lys Glu Arg Gin He 

ACTCCCGATG7AT ATAACGTCACTCC AGTGA TGAACACACATCCGTCCCAATCAGAAGCAG7ATCAGATGTTTGGTCTCTCTCA AACACA 
TGAGGC'C » AC A7 A T AT ' CC AH^'^ ArCTC ACT'AC T VST CTGT AGGC AGGGTT AGTCT TCGT C A T AGTC T ACAAACCACAGAGAGH i C 7 C T 2 



Thr Pro Aso Vol T y r Ly8 Vof Thr Pro Vol Met Asn Tnr H, S Pro Ser Gin Ser Glu A.o Vol Ser A S p Vol 7rp Ser Leu Ser ly , Tnr 



Xba I 



TCCATGACTTTCCA ACCCAAGAAAGCAAGTCTTCAGCCTCTCACCA7ATCCTTGGATGAATTGTTCICA7C7AGAGGACACMCA:c: C: 

aggtactgaaaggttcggttct ttcgttcagaagtcggagagtggtat aggaacct ac Ti aac aagagt agat ctcctctcaagjagaga 2790 



Ser net Thr Pne Gin Pro Lys Lys Alo Ser Leu Gin Pro Leu Thr lie Ser Leu Asp Glu Leu Pne Ser Ser Arg Gly G«u Pne He Ser 
CTCGGAGGTGACGGACGAATGTCT^ 

CAGCCTCCAC TGCCTGCT TACAGAGT ATTTCTCCGGTAGGACGAGCCGGACTC TATGTTTTTC aaC ATC T 7 AG T 



CCGC7CTCAG7T7ATA 



+ 2880 



Vol Gly Gly Asp Gly Arg net Ser His Lys Glu Alo lie Leu Leu Gly Leu Arg Tyr Lys Lys Leu Tyr Asn Gin Alo Arg Voi Lys Tyr 

TCTCTGT AGACTATGAAAAAAAGTAAC AGATATCACGATCT AAGTGTT ATCCCAATCCATTC ATCATGAGT TCCT T AAAGAAGATTC TCG 
AGAGACATCTGATACTTTTTTTCATTGTCTATAGTGCTAGATTCACAATAGGGTTAGGTAAGTAGTACTCAAGGAATTTCTTC7 



2970 



CTAAGAGC 



Ser Leu • 



r 



M 



Met Ser Ser Leu Lys Lys lie Leu 
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3060 





~~ ■ m 

* - ,e Asp Ly s V Pre C, y vo, Asp Glu « te p Thr ^.^^ V , lys ^. !Mo , 

« , r vo , - - Asn - Pro Phe ^ g 7hr - ser - - Ato - ser zT^^-z^r 

GTCCC '^GCALUUAAGATGTTTTAGAACCGAAAAAACCCAAGAAGATl AUAt I I CCGGTGAGGTCGCCATAACCGTCTAGTTCCAGTT^ 



3240 



- * ,s Ar, P r 0 Phe fyr Lys , (e Leu Wo phe ^ 0y &r SertenLeuLysAbThrproAioVo|T::::: ^— 

z z v k. ^ ~ Cys z Gly Ato Tyr LeuPro - :~ - Lys Thr pro fro - Leu ten - ^ - — 

.Sac I 

- *. Asn „. Cly Leu v Lys Cly Ihr lle ciu Leu „ ~ - " ~ Asp tep - s 7^^ — 
^^^^ 

^"-^^ 3S00 
^ Z Z Z Ser Ser Ly s Pne ~ As. Phe ~ - r — - Phe ~ Leu " " - ^ ^ ~ ~ 



A'o 7rp vol Leu Asp Ser lie Ser His Phe Lys 
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Miu 



TCCAGCCTCTCGAACAACTAATATCCTGTCTTTTCTA TCCCTATGAAAAAAACTAACAGAGATCf!ATrT f:TTTflrrr 

aggtcggagagcttgttgattat aggacagaaaagatagggataci ^ttt7gat7gtctctagc7agac aaa7gcgc ac tga7 ac tTca 

r-c — 

Met Lys 

CCC^TACTTACCCTTTTTATTCATTGGGGTGAAnGCA AGTTCACCA.ACTTTTKCACACAACCAAAA A^^ 

cggaaaacatgaatcggaaaaataagt AAcecc ac aacgttcaagtggtatcaaaaaggtgigt tcg t 7 it tcct tic ac ct- 7 "lac 3670 

" ■ G ____ 

Cys Leo Leu 7yr Leu Ale Pfte Leu Phe 11*. riw u->f a~ r ^ " " ' 

He C,y Vo, Asn Cys Lys Pne 7«r , le Vol Pne Pro H, 5 Asn Cm Lys Gly Asn trp Lys Asn 

rK 5 ^A T TACCA7 T AnGCCCG7CAAGCTCA G A77T LA77GGCA7AATGAC77AA T AGGCArA f: r r t 7, r . t . T 

AAGGAA G A77AA TG G7AArAAC G G C CAGT{cGAG T C7AAl777AACC G7l77AC T GAA T :A7CCG7G T C G G7A7G T 7c^TT^ »» 

Vo, Pro Ser Asn Z Z Tyr Cys Pro Ser Ser Ser Asp Leu 7 n l' P H,s Asn Asp Leu He G,y Thr *o Z Z Z Lys Z pro 

tItIa^cTg^^ 

TCTCAGTGTTCClaA I AAGTTCGTCTGCCAACCTACACAGTACGAAG GTT'IACCCAGTGA'TGAACACTAAAGGCGACCATACCTCGCTTCa 

" ^ " C ^ GV - - * - - - Cys Asp P. Arg 7r P 7yr " Pr0 ~ 

! ^A G TCCA7CCGATCC7TCACTCCATCT GT AGAACA A7 G CAA GG AAA G CA7T G AACAAAC G AAACAA f: GA,r TT r. 

1ATATTG7GTCAGGTAG G CTA G GAAGTCAGGTAGACA7CTTG77ACGTTCCTTTC G 7AACTTG7TTGCTTT G 77CCTTGAACCGAC7TAG 4M, ° 

V He 7. G,n Ser ,,e Ar 9 Ser Phe Th r Pro Ser Vo, G ,u G,n t Lys C,u Ser He Z Z 7 hr Lys Z C,y Th r Trp Leu Asn 
^T T A7 G 7G7CC7C77A C CCAAC T AA G 7G T CAA G 7a G 7:gCC77 77AC G 7C G 77AA7 C :a7AC GG G G TGACAC G 7A77 G A G A7 G 77 GG ; 



4320 



4050 



Aso G. V ^ Gly Glu Trp VQ| tepSer Qn p „ e „ e ^ ay LysCysSepAsnV )leCysProThrVo( H , s Asn Ser , hr ^ 

CC G 7AA u AC7CA7AT7CCA G T7TCCC G A7ACAC7AA G A7, GG A G 7AAAG G 7ACC7 G 7AG; GG AA G AA G A G 7C7CCTGCCK7C G A,A C ,; ^ 
*P K,s Ser Asp V Lys Vo, Lys G ,y Leu Cys Asp Ser Asn Leu ^ Ser « Asp , le Thr Phe Pne Ser Gkj Asp Gly C,u Leu Ser 
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CCCTGGGAAAGGAGGGCACAGGGT7CAGAAGTAACTACTTTGCTTATGAAACTGGAG6CAAGGCCTGCAAAATGCAAT 



ACTGCAAGCAT7 



GGGACCCT 



TTCC7CCCGTGTCCC AAGTCTTCATTGATGAAACGAATACTTTGACCTCCGTTCCGG 



ACGTTTTACGTTATGACGTTCGTAA 



4500 



Ser Leu G.y Lys Glu G.y Thr Gly Phe Arg Ser Asn Tyr Phe Ale Tyr Glu Thr G.y G.y Lys AJo Cys Lys net G»n Tyr Cys.Lys H,s 

GGGGAGTCAGACTCCCATCAGGTGTCTGGTTCGAGATGGCTGATAAGGATCTCTTTGCTGCAGCCAGATTCCgTGAATGCCCAGAAGCG7 
CCCCTC AGTCTGAGGGT AGTCCACAGACCAAGCTCTACCG^CTATTCCTAGAGAAACGACGTCGGTCTAAGGGACTTACg'gGTCTTCCC A ^ 



Trp Gly Vol Arg Leu Pro Ser Gly Vol Trp Pne Glu Met Ala Asp Lys Asp Leu Phe Ala AJo Aio Arg Phe Pro Glu Cys Pro Glu Gly 

caagtatctctgc tccatctcagacctcagtggatgtaactctaattcaggacgttgagaggatcttggattattccctctgc 

GT TCATAGAGACGAGGT AGAGTCTGGAGTCACCTAC ATTC AGATTAAGTCCTGCAACTCTCCTAGAACC TAAT AAGGGAGACGGT TC IT T ^ 



— G 



Ser Ser lie Ser Ale Pro Ser Gin Thr Ser Vol Asp Vol Ser Leu lie Gin Asp Vol Glu Arg He Leu Asp Tyr Ser Leu Cys Gin Glu 
GGACCTCG-TTTAGTCTCGCCCACAAGGTTAGAGAGGTCACCTAGAGTCGATAGAACGAGGATTTTTGGGTCCT7GGCC 



1 H 4770 

AGGACCAAAGT 



Thr Trp Ser Lys He Arq A.o Cly Leu Pro lie Ser Pro Vol Asp Leu Ser Tyr Leu Alo Pro Lys Asn Pro G 



ly Thr Cly Pro A;q Pne 



CCATAAICAATGGTACCC-AAAATACTTTGAGACCAGATACATCAGAGTCGA TATTGCTGCTCCAATCCTCTC 

ggtatiagttaccatgggattttatcaaactctcgtctatgtagtctcagctataacgacgaggttagcagagtTcTT 



AA^-MOr. TCCCAATC' 



ACC.-.GCC- ' AC 



J60 



Thr He :«e Asr. Cy .Thr Leu Lys Tyr Phe Glu Thr Arg Tyr lie Arg Vo. Asp He Alo Alo Pro He Leu Ser Arg net V, Z *y Hot 
TCAGTGGAACTACCACAGAAAGGGAACTGTGGGATGACTGGGCACCATATGAAGACGTGGAAATTGGACCCAATGGAGTTrTr.Ar.r. A rr. 



AGTCACCTTGATGGTGTCTTTCCCTTGACACCCTACTGACCCGTGGTATACTTCTGCACCTTT 



AACCTGGGTTACCTCAACAC7CC IGGT 



4950 



He Ser G.y Thr Thr Thr G«u Arg Glu Leu Trp Asp Asp Trp Alo Pro Tyr Glu Asp Vol Glu He Gly Pro Asn G.y Vo. Leu Arg Thr 

GTTCAGGATATAAGTTTCCTTTATACATGATTGGACATGGTATGTTGGACTCCGATCTTCA TCTTAGCTCAAAGGCT 
CAAGTCCT AT ATTC AAAGGAAATATGTACT AACCTGTACCATACAACCTGAGGCTAGAAGTAGAATCGAGTTTCCGAGT 



CAGGTCTTCGAAC 



CCACAAGCTTG 



5040 



Ser Ser Gly Tyr Lys Phe Pro Leu Tyr Met lie Gly He G.y Met Leu Asp Ser Asp Leu H,s Leu Ser Ser Lys Alo Gin Vol Phe Glu 

ATCCTC ACATTCA AGACGCTGCT TCGCAACTTCCTGATGATGAGAGTTTATTTTTTGGTGATAC TGGGCT ATCCAAAAA TCC AATCGA GC 
TAGGAGTGTAAGTTC TGCGACGAAGCGTTGAAGGACTACTACTCTCAAATAAAAAACCACTATGACCCGAT AGGT TTTT AGGTT AGCT 



+ 5130 



CG 



H.s Pro His lie Gin Asp Alo Alo Ser Gin Leu Pro Asp Asp Glu Ser Leu Phe Phe Gly Asp Thr Gly Leu Ser 



Lys Asn Pro He Glu 
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TTGTAGAAGGTTGGTTCAGTAGT TGCAAAAGC TC T ATTCCCTC T TT7 TTCTTTATCATAGCGTTAATCATTn^^r t att n y t *, ^ ^ ^. 



AACATCTTCCAACCAACTCATCAACCTTT 



TCGAGATAAGGGAGAAAAAAGAAATAGTATCCCAATTAGTAACCTGATAAGAACC 



AACACC . 



5220 



""TCGTATCCATCTTTCCATTAAATTAAACCACACCAA CAAAAGACACATTUTACACACAT^CA TCAACrn.rTT^Z^ 

ctcaaccataggtagaaacgtaatttaatttcgtgtggttcttttctgtctaaatatgtctgtatctctacttggctgaacctttcattg 53 '° 



;t Gi - y ^ * » jjj ^ G , y Lys . 1 



Nhel 



TCAAATCCTGCTAGCCAGATTCTTCATGTTTGGACCAAATCAACTTGTGATArr.T.r T r, A ,.,^. T . 



AGTTTAGGACGATCGGTCTAAGAAG 



TACAAACCTGGTTTAGTTGAACACTATGG7ACGAGTTTC7C 



AATTATATTTG AGTTTTTA 



CGGAGTTAAT ATAAACTCAAAAAT 



5400 



ATTTTTATGAAAAAAACTAACAGCAATCATGGAAGTCCAC GATTTTGAGACCGACGAGTTCAATCATTTCA ArGAA.Ar.Arr 

TAAAAATACTTTTTTTGATTGTCGTTAGTACCTTCAGGTGCTAAAACTCTGGCTGCTCAAGTTACTAAAGTTACTTCTACf GATACGGTG 



5190 



«et Qu Vol His Asp Phe Glu Thr Asp Glu Pne Asn Asp Phe ten Glu Asp Asp Tyr. Alo Thr 

TCCCGATGAGCGCATGACGTACTTGAAT CATGCTGATTACAATTTGAA TTCTCCTCTAATTACTGAT^TATTn, 
TTCTCTTAAGGACTTAGGGCTACTCGCGTACT.CATrUAmAp^GACTAATGTTAAACTTAAGAGG^ATTAATCACTACTATAAC; 



AAGAGAATTCCTGAA 



5580 



at, Glu Pne Leu Asn Pro Asp Glu Arc Me, Thr Tyr Leu Asn H,s Ato Asp Tyr Asn Leu Asn Ser Pro Leo He Ser Asp Asp He Asp 
CAATTTGATCAGGAAATTCAATTCTCTTCCGATTCCC TC GATGTGGGATAGTAAGAACTGGGATGGAGT 7C T TGAfiATnT7AflrftTrt.Tr 

gt t aaac t agtcc n 7 aag t t aagagaaggctaagggagct acaccctatcattcttgaccctacctcaagaactc tacaattctagtac 5670 



As. Leu ,, e Ly5 ^ Leu Pro ,„ Pro Ser ne , Vp ^ ^ ^ ^ ^ ^ ^ ^ G u ^ ^ ^ ^ ^ 

TCAAGCCAATCCC ATCTCAACATC TCAGATGCATAAATC CATCGGAACTTGCTTAATCTCTGATAATCAT GATCCf A.T r Aar:rrTAT»r- 

agttcggttagggtagagttgtagagktacgtatttacctacccttcaaccaattacagactattaguctacggtcagttcccatatc 5760 



Gin Aio Asn Pro He Ser TMr Ser Cm Me, H,s Lys Trp Me, Cly Ser Trp Leu Me, Ser Asp Asn H,s Asp A,o Ser G,n Cly ,yr Ser 



Sac II 



TTTTTTACATCAAGTGG ACAAAGAGGCAGAAATAACATTTCACGTGGTGGAGACCTTCATCCCCGGCTG GGGCAACAAACCAAT 
AAAAAATGTACTTCACCTGTTTCTCCGTCTT7ATTGTAAACTGCACCACCTCTGGAAGTAGCCGCCGACCCCGTTGTTTCGTT AACTTAT 5850 



TGAATA 



PMe Leu His G,u Vo, Asp Lys G.u A!o Glu He Tnr Pne Asp Vo, Vo, Glu Thr He ^ G.y Trp Gly Asn Ly S Pro „e Glu Tyr 
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.CATC A MMC C W GaTGG:CTGACTCAT!CAAAATTCTCCrT UTTT C!CTCAAAAGTTTTTCGACTTi>rA r ^ rTTf;trATT 

gtagtttttcctttctacctgactgagtaagttttaagagcgaataaacacagttttcaaaaacctgaatgtgttcaactctaattagaa 59 " 0 

AAATGC7GTCTC T ^ A CCTG G AATTGCTCAACTTGGCGAGGACTTTC AAAGCCAAAGTCACAAGAAGTTCTCATnG Adrr:A 

TTTACGACAGAGACTCCACCT TAACGAGTTGAACCGC TCC TGAAACTT TCCGTTTCAGTCT TC TTCAAGACT ACCTTGCTTCTaTacgTC ^ 



* - v 0I Ser Qu Vo, Glu Leu Leu Asn Leu Mo V 9 T h r P he Ly5 Cly Lys Vol Arg Arg Ser Ser His G,y T h r Asr. Z Cys Arg 
GATTAGGGT TCCCAGC TTGGGTCC TACT TT TAT TTCAGAAGG ATGGGCTTACTTCAAGAAACTTGATA TTnAA tep 

ctaatcccaagggtcgaacccaggatgaaaataaagtcttcctacccgaatgaagttctttgaactataacatiacctggct ttgaaaga 6,20 



-e Arg Vo, Pro Ser Leu Gly Pro Thr P h e „ e S er Gki G,y Trp * V , e lys Lys Leu , p lle „ ^ , g ^ ^ ^ 
£^^T^GT^AAAGATGTGATTATAGGGAGGATGCAAACGGTG CTATCCAT GG TAT G TAGAATAGACAACCTGT7rTr Af; 

caattaccagtttctacactaatatccctcctacctttgccacgatagg'taccatacatcttatc tgttggacaagagtctcgttctgta 62,0 

Leu He, vo, Lys Asp Vo, Ile lle Gly ^ Met Gln Thr „ Leu Ser Met Vol Cys Arg lie Asp Asn Leu P„e Ser C,u C,n Asp „e 

££^T^TCTAAATA T CTACAGAATTG G A G ATAAAATT G T GG A G A GG CA G GGAAArTT.-TCTTAT G ArT T .Ar T ,^ r , rTP(?Atr . 
bAAGAGGGAAGATTTATAGATGTCTTAACCTCTATTTTAACACC TCTCCGTCCCTrTAAAAACAATACTG AACTAATTTTACCACC7TGG ^ 

Ser Leu Leu As. ,„ Tyr A, 9 „. Gly ^ Lys , le Vo , ol^Z^ZZ^Z „e L y s Me, Vo, G,u Pro 

GATATGCAACTTGAAGCTGATGAAATTAGCAAGAGAATCAA GGCCTTTAGTCCCACAATTCCCTCAT TTTGAAAATr AT Tc 

CTATACGTT G AACTTC G ACTACTTTAATC G TTCTCTTAGTTCCGGAAATCAGGGTGT7AAGGGAGTAAAACTTTTAGTATAGTTCTGAAG 

"e Cys As. Leu L V S Leu He, Lys Leu Alo Arg Glu Ser Arg Pr'o Leu Vo, Pr 0 tiin"phe Pro H* Pr* Z Asn H,s „ e Lys T h r Ser 

lEllCATUAGGGGCAAAAATTGACCaAGGTATAAGATTCCT CCATGATCAGATAATGAGTGTGAAAAC Ar.TG.ATr, 

ACAACTACTTCCCCGTTTrTAACTGGCTCCATATTCTAAGGAGGTACTAGTCTATTACTCACACTTTTGlcACCTAGAGTGTGACCACTA " 



+ 6390 
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TATTGATGTGTCATATGC AAAA GCACTTGCAAGTGATTTAGCTCGGATTGTTCTATTTCAACAGTTCAATGATr flTAflfl AarTr 

ataactacacagtatacgttttcgtgaacgttcactaaatcgagcctaacaagataaagttgtcaagttactagtatttttcaccaagca 6660 



I, Asp « Ser Tyr A,o Lys Ala Leu A,o Ser Asp Leo «, Ar, „ „ Leu Phe Gin Gin Phe Asn Asp His Lys Lys, Trp Phe Vol 
GAATGGAGACTTGCTCCCTCATGATCATCCCTTTAAAAGTCATGTTAAAGAAAATACATGGCCCACAGCTGf TrAJiCT TTTT „. 



CTTACCTCTGAACGAGGGAGTACTAGTAGGGAAATTT 



TCAGTACAATTTCTTTTA7GTACCGGGTGTCGACGAGTTCA 



agttctaaaacc 



6750 



*sn G,y ,,o Leu Leu Pro h,s Asp H,s Pro P,e Lys Ser His' Vo, Lys Glu Asn Thr Trp Pro Thr A,o Gln Vo, Gin Asp Pne Gly 

ACAUAATCCCATGAACTKCGC^GArTAAATGTTTTGAAA TACCCGACnACTAGACCCATCGATAA TATArrrTn.r . 

TCA.-TACCG.ACTTGAAGGCGACTAATTTACAAAACTnATGGGCTGAATGATCTGGGTACCTATTAlATGAGACTGnTTCACAAC ™ 0 



He lie Tyr Ser Asp Lys Ser -,. s Ser 



ASP Lys Vo H, fa, Leu P ro Leu ile Lys £ys Pne ^ ||e ^ ^ ^ ^ ^ ^ ^ 

AATGAA:A GG:CAGACCTCTTCAAACAT G TCCGAATaAATCCGAACACTCCTATCCCTAGTAAA AAG E TnTTnr t r:,r 7 ,..r,. 
?!ACi*AtCCACTClCCACAACTTT G 7ACAGGC7TACT7AGGCT TGTGAGGATAGGGATCATT TTTCCAC AACCTCTGA7 



ccacac 



ACAACCTC1C 



■+ 6930 



« ,, Q Ser fa, Vo, Leu Lys H, 5 Vo, Ar 9 „ ^ Pro ^ Thr Pr0 „e Pro Scr Lys Lys to l Le u Cln ltl r ne, t0u ^ -„ r 
.TTCCGA.GCAACCTTTCTTAAAGAArTTCTCTAACTACTCnCCCGAATCTACTAclACTAGATTAATAACCAGAAlT^CTTTCC^ ^ 



Lys A,o Thr Asn Trp Lys G , u Pne Leu Lys G ,u „e .Asp Glo Lys Gly Le u Asp Asp Asp Asp Leu »e » Gly Leu Lys Giy Lys Giu 

£^£^^^^a^gtt GG caggtagatttttctccctaatgtcttgga aattcc G a G aatacttt G taattacccaatat?t^ 

ctcccttcacttcaaccgtccatctaaaaagagggattacagaacctttmcgctcttaIgaaacattaatggcttataaactatttctg 



-9 Glu Leu Lys Leu A,o Gly Aro Pne Phe Ser Leu Met Ser Trp Lys Leo Arg Ctu Tyr Phe Vol Ile Thr Glu Tyr Leu Ile Lys Thr 
lE^^CTATGTTTAAAGGCCTGACAATGGCGGAC GATCTAArTCCAGTCATTAAAAAGATGTTAGATTCCTCATcr^ rr^rr 

agtaaagcagggatacaaatttccggactgttaccgcctgctagattgacgtcagtaatttttctacaatctaaggagtaggccggttcc 72 °° 



ATTGAAGTC ATATGAGGCAATTTGCATAGCCAATCACATTGATTACGAAAAATGGAATAACCACC AAAGGAArTTA 

taacttcagtatactccgttaaacgtatcggttagtgta^taatgctttttaccttatIcgtggtttcctt 



ggcccagt 



caatagtttgccgggtca 



7290 



Leu Lys Ser Tyr Glu A, 0 „ e Cys He A,o Asn H,s He Asp Tyr Glu Lys Trp Asn Asn He Gln V g Lys Leu Ser Asn C,y Pro Vo, 
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G rT,CC G ACrT A TC G acCACTKT TA C C TTATCCATCCTTM TC G AaA C MCTC A TCA A TrTTTT G . GA ^. Tr TT , T , T , rT Gc ' 

caaggctcaatacccggtcaa G aatccaataggtaggaattagctctctt G agtacttaaaaaactcttttcagaat atat gatgttacc 7380 

Phe Ar 9 Vo, « Sly Gln Phe Leu Gly Tyr Pro ^ Uu , (e G L 9 T.H,s G,oPhePhe G , uLy sSer L eone T y r T y r Asn G, y 
AAGACCAGACTTGATGCGTGTTCACAACAACACACTGATCA ATTCAACCTCCCAACGAGTTTGTTGGCAAGGA rAA.A.^^. 

ttctggtctgaactacgcacaagtgttgttgtgtgactagttaagttggagggttgctcIaacaaccgtkctgttctcccacctgacc; 7 " 7 ° 



AGGTCTACGCCAAAAAGGATGGACTATCCTCAATC TACTGGTTATTCAAAGAGAGGCTAAAATCAGAA ACACTGCTGTCAAAGTCTTGGr 

tcgagatgccgtttttcctacctgataggagttagatgaccaataagttktctccgatIttagtctttgtgacgacagIttcagaaccg 7550 



G, y Leo Ar 9 Gin L y s G, y Vp Thr l,e Leu as„ Leo Leo Vo, He G,n V 9 G, u A!o L y s lie Ar 9 As. Thr a, 0 Vo, L y s Vo, Leo A,= 

ACAAGGTGATAATCAAGTTATTTGCACACAGTATAAAACG AAGAAATCGAGAAACGTTGTAGAATTACAGCCT GrTrTr A A 

TGTTCCACTATTAGTTCAATAAACGTGTGTCATATTTTGCTTCTTTAGCTCTTTGCAACATCTTAATGTCCCACG AGAGTTAGTTrACC : 7550 



Gin G, y Asp Asn G,n Vo, „e C y s Thr G,n , y r Ly s Thr Ly5 Ly s S er Ar, Asn Vo, Vo, G, 0 Gln Gly Al0 Leu ^ Gln ^ fa| 
— ^^AATGAGAAAATTATCACTGCAATCAAAA TAGGGACAGGGAAGrTAGGACTTTTGAT.MATGAC GATGAGAClATGC/ ' * t l'G*L 

aa G a I ,attactcttttaatactgacgttagttt 1 atccctg7cccitcIatcctgaaa1ctatttactgctactctcaucgttacacg ,7 '' 0 



Ser Asn Asn G , u Ly s „ e ne( Thr a1o |le Ly£ „. c , y ^ &y ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ ^ 
AGATT ACTTGAA TT A7GGAAAAATACCGATT T TCCG TGGAGTGAT T AGACGGTTACAGACCAAGAGATCGTC ACGAGTCACT TGTCTCAC 

tctaatgaacttaatacctttttatggctaaaaggcacctcactaak.cccaatctctggttctctaccagt gctcactgaacacagtg 7 "° 



Asp V Leo Asn V G, y L ys „e Pro He Phe Arc G, y Vo, .,e Ar 9 Gly Leu G, u Thr Lys ^ Trp Ser V9 Vo , Jhr Cys VQ , , 



Sac I 



CAAT G AC CAAATACCCACTTGTGCTAATATAATGACCKAGTTTCCACAAATGCTCKACCGTAGCTCATTT TGCTGAGAACCCAATCAA 

gtt ac7ggtt7atgggtgaacacgattatattactcgagtcaaaggtgtttacgagagtggcatcga G taaaacgactcttgggttagtt 7920 



Asn Asp G,n He Pro Thr Cy s A,o Asn Ile Het Ser Ser Vo, Ser Thr Asn Ate Leo Thr Vo, A,o H,s Pne A,o G,o Asn Pro „e Asn 
1GCCAT G ATACAGTACAATTA7TTTGGGACATTTGC TAGACTCTTGTT G ATGA T GCATGATCCTGCTCTTCGTC AATCATTGTATGAAGT 

acggtactatgtcatgttaataaaaccctgtaaacgatctgagaacaactactacgtactaggacgagaagcagttagtaacatacttca 



8010 



Ate Met Ile C,n T y r Asn T y r Phe G. y Thr Phe Ate Ar 9 Leo Leo Leu Z Z H. Asp Pro Ate Z ^ Z Ser Z T y r 



Glu Vol 
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rCAACATAAGATACCG GGC^TGCACACTTCTACTTTCAAA TACGCCATHTTftTATTT rmf 

ACTTCTATTCTATGCCCCGAACrTrTr^r*^ ' 1 ' TCCAT |CCACGACTCTC CCCCATC7C 

ccccgaacgtgtcaaga tgaaagtttatgcggtacaacataaacctgggaIggtaacctcctcacagcccctac^ 8,00 

z z z z z g, Leu ; s z s. z z ws ^ Leu - LeuAsp P — ^ zzzzzzz^ 

^Z££!^I!III£^ ^CCCAGA T CCCG T AC A TGI ,, , 

^^^^ 




5370 



GCGATACCCTTACTCAGGTCGCTTGAACAATTTCTGACTCCAATTTTTtACGAATT AGCTTAGTTCTGTT TGGTAGTC 

^ Leu A S n , le A.o Me, G , y Me, Ser Pro Alo Asn Leu Leu Lvt Z Z vc Lys Lv S C yS Le u „ a I ' 

■ ™ >ys Lys Lys Leu lie Glu Ser Arg Gin Thr ]| e Ar- 

£^A£CAGG7GATT AAGGATGCAACCATATATTTGTATr ATr^A APft nr » TCQGCTC ' T 1 ftGTTl 

CTTGGTCCACTAAT TCCTACGTTrrTAT.T^.. , ' " 1 ' *, ^^GCTCAATAAATCCTCTCTTrrr 

ttcctacgttggtatataaacatagtact tctcctagccgagtcttcaaagaaTac^a^ tatttacgagacaagsg ^ 0 
aagt ttagtccgtgaaaaaaccctcagcgtctgcccgagtagtcagataaagttt taagagcatgat aaccct 5550 




+ E6U0 



— _ w 7 wrs> 1-1,5 Le 

^^Tl. „e t Trp Thr C yS Z Z Z ZTL Z Z L eu Arg y L yS Z ZZZ^ZZZZ^Z 
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AACTGTACCCCATCCAT 



TAGAAATGTTGGGTCCACAACATCGAAAAGAGACTCCTTGTGCACCATGTAACACATCAGGG TTCAATTATGT 

ttgacatggggtaggtaatctttacaacccaggtgttgtagcttttctcIgaggaacacgtggtacattgtgtagtcccaagttaataca 8820 



Thr Vol Pro H,s Pro Leu Glu net Leu Gly Pro Gin H,s Arg Lys Glu Thr Pro Cys Alo Pro Cys Asn Thr Sen Gly Pr* A S n Tyr Vol 

Bam HI 



T ^TCTCCATTGTCCACACCGCATCCATGACCTCTTTAGT7CACGCGGA CCATTGCCTGCTTATCTAGGGTCTAAAACATrTnAA7rTAf 

" h 

AGATG 



a*gacacg:aacagctcteccctagctactccagaaatcaagtgcccctgctaaccgacgaatagatcccagattttgtagact7 * 89,0 



Ser w ... s Cys Pro Aso C: y He His Asp Vol Pne Ser Ser Arg Gly Pro Leu Pro Alo Tyr Leu Gly Ser Lys Thr Ser G: u ,Ser Thr 
AT C TAT ' '. *CC ACCC j 7GCGAAACGGAAACC AAACTCCCAC TGAT 7AAAAGAGC7 AC ACGTC T 7 AGAGATGC 7 A TC 7 C 7 T GG T ? * C ' ' GA 

-agataaaacg:cccaaccctttccctttcgtttcagcc7gactaa 1 tttctcgatgtgcagaatctctaccatagacaaccaaa C aac' ; 9000 



Ser Ho Leu Cn Pro Ir 0 Glu Arg Glu Ser.Lys Vol Pro Leu lie Lys Ar 9 Alo Thr Arg Leu Arg Asp *, 0 He Ser TroPne Vol Cu 

ACCCGACTCTAAACT ACCAATGACTATACTTTCTAACATCCACTCTTTAACAGGCGAAGAATCGACCAAAACCCAGCATGGGTiCAAAAC 
TGGGC 1CAGA7T 7 GA7 CGTTAC TGATATGAAAGAT TG7AGGTGAGAAAT TGTCCGCTTC T T ACC7GGT7 TTCCGTCG T ACCC AAG 7 T 7 7 C ^ 



ft- Leu Alo Met Thr lie Leu Ser Asn He H,s Ser Leu 7hr Gly Glu Glu 7rp Thr Lys Arg Gin H,s Gly Pne Lys .ru ' 

AACAGGGTCTGCCCTTCATAGGTTTTCGACATCTCGGATGAGCCATGGTGGGTTCGCATCTCAGAGCACTGCAGCATTGACCAGG TTGAT 

ttgtcccagacgggaagtaIccaaaagctgtagagcctactcggtaccacccaagcgtagagtctcgtgacgtcgtaacIggtccaacta 9180 



Thr Gly Ser Alo Leu His Arg Phe Ser Thr Ser Arg Met Ser His Gly Gly Phe Alo Ser Gin Ser Thr 



Alo Alo Leu Thr Arg Leu Met 



GGCAAC T AC AGAC ACCATGAGGGATCTGGGAGATCAGAATTTCGACTTTTTATTCC AAGCAACGTTGCTCT ATGC TCAAATT 
C «TTGATGTCTG"TGGTACTCCCTAGACCCTCTACTCTTAAAGCTGAAAAATAAGGTTCGTTGCAACGAGATACGAGTTT 



ACCACCAC 



AATGGTGGTG 



9270 



Ato Thr Thr Asp Thr Met Arg Asp Leu Gly Asp Gin Asn Phe Asp Phe Leu Phe Gin Alo Thr Leu Leu Tyr Alo C.n He Thr Thr Thr 

TGTTGCAAGAGACG GATGGATCACCAGTTGTACAGATCATTATCATATTGCCTGTAAGTCCTGTTTGAGACCCATAGAAGAGATCACCCT 
ACAACGT ^^^^^GCC TACC TAGTGGTCAACATGTCT AGTAAT AGT ATAACGGAC ATTC AGGACAAACTCTGGGT ATCTTC TCT AGTGGGA 



Vol A.o Arg Asp Gly Trp Me Thr Ser Cys Thr Asp H,s Tyr H,s lie Alo Cys Lys Ser Cys Leu Arg Pro He Glu Glu lie Thr Leu 

GGACTCAAGTATGG ACTACACGCCCCCAGATGTATCCCATGTGCTGAAGACATGGAGGAATGGGGAAGGTTCC TGGGGACAAGACAT 
CCTGACTTCA7ACCTGATGTGCGGGGGTCTACATAGGGTACACGACTTCTGTACCTCCTTACCCCTTCCAAGC 



AAA 



ACCCCTGTTCTCTATTT 



9450 



Asp Ser Ser hel Asp Tyr Thr Pro Pro Asp Vol Ser H,s Vol Leu Lys Thr Trp Arg Asn Gly Glu Gly Ser Trp Gly On Giu !«e Lys 
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ACAGATCTATCCTT7 AG 



^B^CCCAATTCGAAGAATTTAGCACCTGCTGAGCAATCCTATCAACTCCRr^ATPTAT.rrr 



tctctacataggaaatcttcccttaaccttcttaaatcctggacgactcgttaggatagItcacccgtcIac 



TTTCTATATGG 



ATATCCAAAAGATATACC 



9540 



AGACTTGGCGTATAGAAAATCTACTCATGCCGAGGACAGT 7CTCTATTTCCTCTATCTATACAAGGTCGTAT TAf:Af:f:Tr^Ar 

""^CCCATATCTTTTAGATCAGTACCaCTCCTGTCAACAGATAAAicABATAeAmCTTCCACciTAATCTCCAicTCCAAAC*: ^ 



AAAAGGGTTGCTAGACGGATTAATGAGAGCAAGTTGCT GCCAAGTAATACACCGGAGAAGTCTGGCTCATT TGAAnAr:f:rr^^'rr a 
TTTTCCCAACGATCTGCCTAATTACTCTCGTTCAACGACGGTTCATTATGTGGCCTCTTCAGACCGAGTAAACTTCTCCGGCCGGTTGCG 



9720 



L y s G, y Leu Leu Asp Cl y Leu Het Ar, A,o Ser C y s Cys G,n Vo, „e H,s Aro Arg. Ser Leu A,o H,s Leu L ys Aro. Pro A,o Asn A,= 




AT *^* A ^^ A ^^CCCACAAGATCCCAACCTCCT ATCCGACAAGCAACCGTGA7ATGGGGGTGATT GTTA G aaat T a r t 



iaatctttgctaagggctcttctagggttggaggataggctgttcgttcgcactata cccccactaacagtctttaatgaagtttatggt "°° 



Leu ft, Tnr He Pre H,s L y s He P ro T h r Ser T y r Pro T h r Se, Asn A, Asp Me, G, y Vo, ,„ vc Ar 9 Asn V Pne Ly s T y r G,n 




10080 



C Ar 9 Leu ,„ Glu Lys Gly Lys lyr ^ g Sef H , s , yr 3^ uu ^ ^ ^ ^ ^ ^ ^ ^ 
ATTCTCT AT TTCCACCACCCTCTTGCAAATCCTAT AC AAGCC AT TTTTATC7GGGAAAGATAAGAAT G AGT TGAn AH fipnrrr T 

taauagataaaggtggtgggagaacgtttaggatatgttcggtaaaaat^accctt tcuttcttactcaactctctcgaccgtttagI 

Pne Se, „. Ser T h r T hr Leo Leu Cln lle Leu Tyf Lys Pro Ph L e Leu Ser ~ Lys ,3,, Lys ^ - ~ ~ ~ ^ 
TTCnCATTGCTAA G ATCAGGAGAGGGGT G GGAA G ACA TACAT G TGAAATTCTTCACCAAGGACATAT TATTr.TCT r r^^ q 

aagaagtaacgattctagtcctctccccacccttctgtaIgtacactttaagaagtggttcctgtataataacacaggtctcctttagtc ,0 ' 7 ° 



Ser Ser Leu Leu Ar 9 Ser G, y Gki G, y Tr P G,u Asp He H,s Vo, L y s Pne Phe Tnr L y s Asp He Leu Leu C y s Pro G.u G,u he Arc 



14/26 

FIG. 1 (CONT.) 

BNSDOCID: <WO 9634625A1 I > 



WO 96/34625 



PCT/US96/06053 



ACATCC 7 TGCAAG7 7CGGGA7 7CC ^ AA GGA7AA7AA7AAA G AC A7GAGC 7 A7CCCCC77GGGGAAGG GAA7rr AEamr? ar ^ ft TT • 

tgtacgaacgttcaagccctaacgattcctattattattIctgtactcgatagggggaaccccttccctuggtctcccIgttaatgttg 10260 

H,s A,o Cys Lys Pr,e Gly »e A,o Lys Asp Asn Asn Lys Asp Met Ser 7yr Pro Pro 7rp Gly 7 9 Glu Ser Ar, Gly Z Z 7nr 7 h r 
AATCCCTGTTTATTATACGACCAC C CC TTACCCAAAGATGCTAGAGATGCCTCCAAGAATCCAAAATC CCCTGrTrTrrrrr 

ttaggcacaaataatatgctggtggggaatgggtttctac G atctctacggaggttcttaggttttaggggacgacaggcct7agtccaa 10350 



CGGCCAATTACCAACTGGCGC7CATTATAAAATTCGGAGT ATATTACATGCAATGGGAATCCATTACAG GGArTTrTTr:flrTTrTrr 

cccggttaatggttgaccgccagtaatattttaagcctcatataatgtaccttacccttaggtaatgtccctgaagaactcaacacctct 



G, y On Lou Pro Thr G,y A,o H,s T y r Lys He Ar 9 Ser „e Leu His G V Met Gly U. H is Tyr -9 Asp Pne Leu Ser Cys Gly Asp 




Nar I 



10620 



,-,^O.GG:G„TCTCCTGAGCCCCCCAGTGCCCTAG AAACTTTAGGAGGAGATAAATCGAGATGTGTA AAT f: . Tgt 

CTACGCrcCGCGGAGAGGACTCGGGGGGTCACGGGATCTlTGAAATCCTCCTCTATTTAGCTCTACACAlTTACCACTT^TACAACCc;' 

ne, V 9 G; y A,o Ser Pro Glu Pre Pro Ser A,o Leu G,u 7nr Leu Gl y G,y Asp Lys Ser Ar 9 Cys Vo, Asn Gly Glu Thr Cys Irp Clu 

A767CCA7C7GAC71 A7G 7GACCC AAGGAC ^ ^GGGAC T ATTTCCTCCGACTCAAAGCAGGCTTGGGG CT TCAAATTTA TTT 

TArACC-ACACGAATACACTGGGTTCCTGAACCCrGATAAAGGAGGCTGAGTTTCGTCCGAACCCCGAAGTTTAACTA^ATTAACATTA ' 07 '° 



V Pro Scr AS0 Leu Cys Aso Pro Ar 9 1h r 7rp Asp 7 y r PHe Leu Ar 9 Leu Lys A,o G,y Leu G,y Leu G.n „e Asp Leu He Vo, He, 
GGA TA7 GCAA C7 T CGCG A T I C T 7C 1 ACT ACCC7GAAAA7 7GAGACGAA7G7 7ACAAATTATC7CCACCGGA7 T 7 7 GGA7GACCA AGGAC t 
CCTA7ACC77CAAGCCC 1 AAGAAGA7GA7CGGACT777AAC7G7GC7,ACAA7C7„AAUCACG7GGCC7AAAACC7AC7CG7,CC7CA ™°° 



ASP «e« G,u Vo, Ar 9 Asp Ser Ser 7hr Ser Leu Lys He Glu 7hr Asn Vo, Ar 9 Asn 7 y r Vo, H,s V 9 He Leu Asp G,u G,n G,y Vo, 



Sal ! 



^A i CUCAAGAC77A7G G AACA7A7A T T7GTGAGAGCGAAAAGAATGCAG7AACAA7CC77GG7CCCA7G77CAAGArGn7r^r T , 



AAA77AGA7G77C7GAA7ACC77G7A7A7AAACAC7C7CGC77T7C7TACG7CA77GT7AGGAACCAGGG7AC 



AAGTTCTGCCAGCTGAA 



+ 10890 



Leu lie V Lys Thr Tyr Gly Thr Tyr He Cys Glu Ser Glu Lys Asn Ala Vol Thr He Leu Gly Pro het Phe Lys Thr 



Vol Asp Leu 
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AG 



TTCAAACACAATTTACTACTTCTCAAACGTCTCAAGTATATATGGTATCTAAABGTTTCAACA^TTA.TrrATr^rr 



CAATCCCGA 



TCAAGTTTGTCTTAAATCATCAAGAGTTTGCAGACTTCATAT 



ATACCATACATTTCCAAACTTCTTTAATTAGCTACTTGGGTTAGGGC" 



10980 



Vol Cm Thr Glu Phe Ser Ser Ser Gin Thr Ser Glu ,Vo> Tyr flel Vol Cys Lys Gly Leo Lys Lys Leu He Asp G.u Pro Asn Pro Asp 

TTGGTC T TCCATCAA7GAATCCTGGAAAAACC TGTACGCATTCCAGTCATC AGAACAGGAATTTGCCAGAGCAAAGAA GGTT AGT AC AT A 
AACCACAACCTAGT7ACTTACGACCTTTTTCGACATGCGTAAGGTCAGTACTCTTCTCCTTAAACGGTCTCCTTTCTT 



CCAATCATGTAT 



1 1070 



TroSer Ser He Asn G!u Ser Trp Lys Asn Leu Tyr Alo Pne Gin Ser Ser Glu Gin Glu Phe A.o Arc, Mo Lys Lys Vol Ser Tnr I /r 
CTTTACCTTGACAGGT ATTCCC tcccaattcattcctgatccttttgtaaacattgagactatgctacaaat ATTCCGAGT ACCC ACGGG 

GAAATCCAACTGTCCATAAGGGAGGGTTAAGTAAGGACTAGGAAAACATiTGTAACTCTiATACGATGTnATAACCCTCAICGCrGCCC 



+ MI60 



Phe Thr Leu Tnr Gly ,le Pro Ser G.n Phe He Pro Asp Pro Phe Vol Asn lie Glu Thr Met Leu G.n He Pne Gly Vol Pro Thr Gly 



TGTGTCTCATGCGGCTGCCTTaaaATCATCTGATAGACCTGCAGATTTATTGACCATTAGCCTTTTTTATATGGCGATTATa TrnTftTTA 



1 1 I 1 j j , j .n,.np,, M | U|)Ll)AI|fl|A I LGTATTA 

aca cagagtacgccgacggaattttagtagactatctcgacgtctaaataactggtaatcggaaaaaatataccgctaa!ataccataa! 



+- U250 



Vol Ser h. AJo A.o A.o Leu Lys Ser Ser Asp Arg Pro Alo Asp Leu Leu Thr lie Ser Leu Phe Tyr Met Alo lie i.e Ser Tyr Tyr 

TAA CATCAATCATATCAGAGT AGGACCGATACCTCCGAAr riCCCA'rr.AGA TGG/'. * f ~"C ACAAAATGTGGGGATCGC TAT AACTGGTAT 
ATTGTAGTTAGTATAGTCTCATCCTGGCTATGGAGGCTTGGGGGGTAGTCTACCTTAACGTGTTTTACACCCCTA 



+■ ' r 11340 

AGCGATATTGACCATA 



Asn lie Asn His lie Arg Vol Gly Pro He Pro Pro Asn Pro Pro Ser Asp 
WnD III 



Gly lie Alo Gin Asn Vol Gly lie Alo He Thr Gly lie 



aagcttttggctgagt ttgatggagaaagacattccactatatcaacagtgtttagcacttatccagcaatcattcccga ttaggtggca 
ttcgaaaaccgactcaaactacctctttctgtaaggtgatatagttgtcacaaatcgtcaataggtcgttagtaagggctaatccaccci ' ,<)30 



Ser Phe Trp Leo Ser Leu He, Glu Lys Asp lie Pro Leu Tyr Gin Gin Cys Leu Alo Vol lie Gin Gin Ser Pne Pro Me Arg Trp Glu 

cgctgtttcactaa aaggaggatacaagcagaagtggagtactagaggtcatcggctcccaaaagatacccgaacttcagactc cttggc 

CCGACAAAGTCATTTTCCTCCTATGTTCGTCTTCACCTCATGATCTCCACTACCCGAGGGTTTTCTATGGGCTTGAACTCTGAGGAACCG " 52 ° 



Leu Aio 



Ak, Vbl Ser vol Lys Gly Gly Tyr Lys Gin Lys Trp Ser Thr Arg Gly Asp Gly Leu Pro Lys Asp Thr Arg Thr Ser Asp Ser 
CCCAATCGGGAACTGGATCAGATC TCTG6AATTGGTCCGAAACCAAGTTCGTCTAAATCCATTC AATGAGATCT T f:T TC Lclr &r.n *Tr 



GGGTTAGCCCT TGACC TaCTC TAGAGACCTTAACC AGCCTT TGGTTCAAGCAGATT TAGGTAAGT T ACTC T 
- _ 



1 : — *— 1- 11610 

AGAAC AAGT TAGTCCATAC 



Pro He Giy Asn Trp |, e Arg Ser Leu Glu Leu Vbl Arg Asn Gln Vol Arg Leu Asn Pro Pne Asn Glu He Leu Phe Asn Gin Leu Cys 
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TCGTACAGT 



GGATAATCATTTGAAATGGTCAAATTTGCGAAGAAACACAGGAATGATTGAATR^flTr aatac 



AGCATGTCACCTA 



ACGAATTTC AAAAGAAGA 



ttagtaaactttaccagtttaaacgcttctttgtgtccttactaacttacctagttatctgcttaaagttttcttct 1 1700 



Arg Thr Vol Asp Asn His Leu Lys Trp Ser Asn Leu Ar 9 Arg Asn Thr Gly Met lie Glu 7rp He Asn Arg Arg ,,. Ser Lys Glu Asp 
CCGGTCTATACTGATGTTGAAGAGTGACCTACA CGAGGAAAACTCTTGGAGAGATTAAAAAATCATGAGGAGACTCCA AACTTTAAGTAT 

ggccagatatgactacaacttctcactggatgtgctccttttgagaaccIctctaattttttagtactcctctgaggttIgaaattcat. 11790 



Arg Ser ji e Leu Met Leu Lys Ser Asp Leu H.s Glu Glu Asn Ser Trp Arg Asp • 



At! II 



GAAAAAAACTTTGATCCTTAAGACCCTCTTGTGGTTTTTATTTTTTATCTGGTTTTGTGGTCTTCGTGGGTr.nr.T^r AT r T 



CCACCT 



CTTTTTTTGAAACTAGGAATTCTGGGAGAACACC 



AAAAATAAAAAATAGACCAAAAC ACCAGAAGC ACCCAGCCGT 



ACCGT AGAGGTGGA 



1 1880 



RB2- 



Rsr II 



Tth 



Apal Sac II 

CCTCGCGGTCCGACCTGGGCATCCGAAGGAGGACGTCGTCCACTCGGATGGCTAAGGGAGGGGCCCCCGCCGGGCTGCTAACAAAnrrr 



AGGCTTCCTCCTGCAGCAGGTGAGCCTACCGATTCCCTCCCCGGGGGCGCCCCGACGATTGTTTCGGGC 
RBZ — — 



CCG 

*■ 1 1970 




T7 Ter 



AAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATAACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGA GGGGTTTTTTGCT 
TTTCCTTCCACTCAACCGACGACGGTGGCGACTCGTTATTGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAAC7CCCCAAAAAACGA ^ 



77 Ter 



.Spel Sac I 

CAAAGGAGCAACTATATCCGGATCGAGACCTCGA^^^ 



CTTTCCKCTTGATATAGGCCTAGCTCTGGAGC7A 
■ T7 Ter- 



TTCGAGC 



TGATCACGCCACCTCGAGGTCGAAAACAAGGGAAATCACTCCC 



AATT AAAGCTCG 



12150 



TTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAAC AT 
AACCGC ATTAGTACCAGT ATCGAC AAAGGACACACTTTAACAATAGGCGAGTGTTAAGGTGTGTTG 



ACGAGCCGGAAGCATAAAGTG7 
-H h f , 



TATGCTCGGCCTTCGTATTTCACA 



r 12240 



AAAGCCTGGGGTGC CTAATGAG7GAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAAC 

tttcggaccccacggattactcactcgattgagtgtaatIaacgcaacgcgagtgacgggcgaaaggtcagccctttggacagc, 



CTGTCGTGCCAG 



12330 



ACGGTC 
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CTGCATTAATGAATCGGCC AACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCG 

1 1 1 ! ^ 1 ' — - — 1 i ■ ■ l ' 1 ' 1 1 f- J2420 

GACGT AATTACTTAGCCGGTTGCGCGCCCCTCTCCGCCAAACGC ATAACCCGCGAGAAGGCGAAGGAGCGAGTGACTGAGCGACGCGAGC 

GTCGTTCGGCTGCGGCGAGCCGTATCAGCTCACTCAAAGGCGGTAA7ACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATG 

1 1 1 1 1 1 1 1 1 1 ' i 1 1 1 i 1 h 12510 

CAGCAAGCCGACGCCGCTCGCCATAGTCGAGTGAGTTTCCGCCATTATGCCAATAGGTGTCTTAGTCCCCTATTGCGTCCTTTCTTGTAC 

TGAGC AAAAGGCC AGCAAAAGGCCAGGAACCGT AAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCC TG ACGAGC ATCAC 
1 1 1 1 ' 1 ' 1 1 1 ' 1 < 1 1 1 1 f. 12 600 

ACTCGTTTTCCGGTCGTTTTCCGGTCCTTGGC ATTTTTCCGGCGCAACGACCGCAAAAAGGTATCCGAGGCGGGGGGACTGCTCGT AGT G 

AAAAATCGACGCTCAAGTCAG AGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCT 

1 1 1 1 1 1 1 1 ' 1 ' 1 — 1 i ' I 1 i- 12590 

TTTTTAGCTGCGAGTTCAGTCTCCACCGCTTTGGGCTGTCCTGATATTTCTATGGTCCGCAAAGGGGGACCTTCGAGGGAGCACGCGAGA 

CCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTC7CATAGCTCACGCTGTAGGTAT 

1 1 1 1 1 1 I 1 i 1 i 1 i , 1 , h 12780 

GGACAAGGCTGGGACGGCGAATGGCCTATGGACAGGCGGAAAGAGGGAAGCCCTTCGCACCGCGAAAGAGTATCGAGTGCGACATCCATA 

CTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTAT 

' 1 1 1 : — ' 1 ' H 1 1 1 i 1 1 , j , h 12870 

GAGTCAAGCCAC ATCC AGCAAGCGAGGT TCGACCCGACACACGTGCTTGGGGGGCAAGTCGGGCTGGCGACGCGGAAT AGGCC ATTGAT A 

CGTC T TGAGTCCAACC CGGT AAGACACCACT T ATCGCCACTGGCAGCAGCCACTGGT AACAGGATTAGCA6AGCGAGGT ATG T AGGCGGT 

1 I 1 1 ' 1 1 i 1 1 1 1 ' 1 i 1 1- 12960 

GCAGAACTCAGGTTGGGCC AT 7CTGTGCTGAATAGCGGTGACCGTCGTCGGTGACCATTGTCCTAATCGTCTCGCTCCATAC A7CCGCCA 

CC7AC AGAGTTC7T GAAGTGGTGGCC7AAC7 ACGGC" ACACTAGAAGGACAG7ATT7GGTA7CTGCGCTCTGCTGAAGCC AG T7 ACCTTC 

• I ; 1 ' i ' 1 1 I ' f 1 h> 1 i 1 h 13050 

CGA7CTCTCAAGAACTTCACCACCGGA77GA7GCCGA7GTGA7CTTCCTGTCATAAACCA7AGACGCGAGACGACT7CGG7CAA7GGAAG 

GGAAAAAGAGT7GG7 AGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGG7GGTTTTT7TGTT7GCAAGCAGCAGAT7ACGCGCAGA ■ 

■* i ' i ' 1 I 1 H ' 1 i 1 i ■ h j 3 J 40 

CC7TT7 7C7CAACCATCGAGAAC7AGGCCGTT7G777GGTGGCGACCA7CGCCACCAAAAAAACAAACGT7CGTCG7CTAA7GCGCG7C7 

AAAAAAG GATC7CAAGAAGATCCTTTGATC7T7TC7ACGGGG7C7GACGCTCAGTGGAACGAAAACTCACGT7AAGGGAT7 7TGG7CA7G 
1 I > 1 ' I ' 1 1 1 ' : ' r— 1 i 13230 

TTTTTTCC7AGAGT7CTTC7AGGAAAC7AGAAAAGA7GCCCCAGAC7GCGAGTCACCTTGCTTT7GAGTGCAAT7CCCTAAAACCAG7AC 

AGATTA7CAAAAAGGATCT TCACC7 AGATCCTTT T AAATTAAAAA7GAAGTTTT AAATCAATC7 AAAGTA7ATATGAGTAAACT7GG7 CT 

1 1 1 i 1 h ' 1 1 i 1 1 ' 1 1 : f- 13320 

TC7AAT AGTTTT7CCT AGAAGTGGA7C7 AGGAAAATT7 AATTTT7 ACTTCAAAATTTAG7TAGATTTCATAT ATACTC ATTTGAACCAGA 

GACAG7TACCAATGCT7AA7CAGTGAGGCACCTA7CTCAGCGATCTG7C7ATTTCGTTCATCCA7AGTTGCC7GACTCCCCGTCG7G7AG 

1 1 ' H i 1 i 1 1 1 1 1 1 1 i- 13410 

CTGTCAATGGT7ACGAATT AGTCACTCCG7GG A7 AGAG7CGC7AGAC AGAT AAAGCAAG7AGG7 ATCAACGGACTGAGGGGCAGCACATC 
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ATAACTACGATACGGGAGGGCTTACCATCTGGCCCC 



TAT7GATGCTATGCCC 



AGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCA 

' ' ' i — i i 1 , u 



TCCCGAATGGTAGACCGGGGTCACGACGT7ACTATGGCGCTCTGGGTGCG 



AGTGGCCGAGGTCTAAATAGTCGT 



13500 



ATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCf CCATCCAGTCTATTAATTGTTCCrnnnAA^r T 



TATTTGGTCGGTCGGCCTTCCCGGC 



TCGCGTCTTCACCAGGACGTTGAAATAGGCGGAGGTAGGTCAGATAATTAACAACGGCCC 



TTCGA 



13590 



AGAGTAAGTAG 



nXGCCAGJTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGT 



TCTCATTCATCAAGCGGTCAATTATCAAACGCGTTGCAACAACGGTAACGATGTCCGTAGC 



CACGCTCGTCGTT7GGTATGGCT 



ACCACAGTGCGAGCAGCAAACCAT ACCGA 



13680 



TCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGCTrrTrr^ATr 



AGT AAGTCGAGGCCAAGGGTTGCTAGTTCCGCTCAATGTAC7AGGGGGTACAAC ACGTTTTTTCGCCAATCGAGGAAGCCAGGAGGCTAG ^ 



G77GTCAGAAG7AAG77GGCCGCAG7GT7A7CAC7C 



ATGG77A7GGCAGCACTGCATAATTCTCT7AC7G7CA7GCCA7CCG' 



CAACAG7C77CA77CAACCGGCG7CACAA7AG7GAG7ACCAATACCG7CG 



7GACG7ATTAAGAGAA7GACAG7ACGG7AGGCA77C7ACG 



13860 



T7 



77C7G7GAC7GG7GAG7AC7CAACCAAG7CAT7C7GAGAATAG7G7A7GCGGCGACCGAGT7GC7CT7GCCC GGCG 

7CA7GAGTTGG77CAGTAAGACTCT7A7CACATACGCCGC7GGC7CAACGAGAACGGGCCGC 



7CAATACGGGAT 



AAAAGACACTGACCAC 



AGT7A7GCCC7A 



13950 



AAT^uuuC gtu^A i A^CaGAAi i y ^AAAG7GC7CA7CA77GGAAAACG77C77CGGGGCGaaaau i U7CAAGGA7C77ACC GC7G77G 
77A7GGCGCGG7G7A7CG7C77GAAA7777CACGAG7Ag!aACC7T77GCAAGAAGCCCCGCT7T7GAGAG77CC7AGAA7GGCGACAAC ^ 



AGA7CCAC77CGA7G 7AACCCAC7CG7GCACCCAAC7GA7C77CAGCA7C7TTTACTT7CACCAGCG777C7GGG7GAGCAAAAACAGGA 
7CTAGG7CAAGC7ACA77GGG7GAGCACG7GGG77GAC7AGAAG7CG7AGAAAA7GAAAG7GGTCGCAAAGACCCAC7CG77T7TG7Ccl 



14130 



AGGCAAAA7GCCGCAA 



AAAAGGGAAT AAGGGCGAC ACGGAAATG77GAATAC7CA7 AC7C7TCC7T777CAA7A77 A77HAARP ATT T a t 



7CCG7TTTACGGCG7T7TTTCCCT7A77CCCGC7G7GCC777ACAAC77A7GAG7A7GAGAAGGAAAAAGT7 



A7AA7 AAC77CG7 AAA7A 



U220 



C AGGGTT AT TGTC TCATGAGCGGATACAT ATT 7GAATG7 ATT7AGAAAAATAAAC AAA7ACGGG TTCCf^rnr AT A T T 



G7CCCAAT AACAGAG7AC7CGCCT A7GTATAAAC T7 ACA7AAATCTTTTTA7TTG7TT A7CCCC AAGGCGCG7 



7CCCCGAAAAGTG 

*■ 11310 



G7 AAAGGGGC 7777CAC 



11311 
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TTGTAGAAGGTTGGTTCAGTACTTGGA 



AAAfeCTCTATTSgCTCTTTTTTCTTTATCATAGGCTTAATCATTGCACTATTCTTGCTTCTPr 



AGG 



AACATCTTCCAACCAAGTCATCAACCTTTTCGAGATAACGGAGAAAAA. 



AGAAATAGTATCCCAATTAGTAACCTGATAAGAACCAAG 

leu VC Glu Gly Trp Phe Ser Ser Trp L yS Ser Ser lie *o Ser Phe Phe Fnc He II. Gy Leu I,e He G.y Uu Phe Le u Vol Leu 

G ■ — 



+ 5220 



GA^GGTATCCATCTTTGCATTAAATTAAAGC ACACCAAGAAAAGACAGATTTATACAGACATAGAGATGAACCGACTT 
CTCAACCATAGGTAGAAACGTAATTTAATTTCGTGTGGTTCTT^ 5310 



Arg Vol Gly lie His Leu Cys li e Lys Leu Lys His Thr Lys Lys Ang Gin He Tyr Thr Asp He Glu Het 

' G 



Asn Arg Leu Gly Lys • 



J 



Nhel 



Xmal 
Sma I 



Not I 



TCAAATCCTGCTAGCTATGAAAAAAACTAACAG ATATCCAACCCGGGAGCTAGTTGCGGCC 

agtttaggacgatcgatactttttttgattgtctataggttgggccctcgatcaacgccggcggatcgtctaagaagtacaaacctggtt 5q °° 




Synthetic DNA insert- 



polyfinker- 



G 3' non-coding- 



ATCAACTTGTGATACCATGCTCAAAGAUGCCTCAATT ATATTTGAr z 1 * < TTTAA rT T TTAT> Aa '.AAC TAACAGCAATfATfrftz ArTrr 

tagftgaacactatggtacgagtttctccggagttaa"'' ' -~- - - — ' ' ' " 1 ' h 



TATAAACTCAAAAATTAAAAATACTTTTTTTGATTGTCGTTAGTACCTTC 



AGC 



5490 




G 3' non-coding- 



Het Glu Vol 



ACGATTTTGAGACCGACGAGTTCAATGATTTCAATG AAGA7GACTATGCCACAAGAGAATTCCTGAATCCCGAT GAGCGCATGACGTACT 

tgctaaaactctggctgctcaagttactaaagttacttcIactgatacggtgttctctuaggacttagggctactcgcgtactgcatga 5580 



Asn Pro Asp Glu Arg Het Thr Tyr 



H.s Asp Phe Glu Thr Asp Glu P he Asn' Asp Phe Asn Glu Asp Asp Tyr Alo Thr Arg Glu Phe Leu 

" L 

TGAATC ATGCTCATT AC AATTTGAATTCTCCTCTAATTAGTGATGATATTGACAATTTGATCACGAAATTCAATTr Tr TTrrr^ TTrrrT 



ACT 



Leu Asn h, s Aio Asp Tyr Asn Leu As n Ser Pro Leu lie Ser Asp Asp He Asp Asn Leu He Ang Lys Phe 
' ~ ~ L — ■ - 



AAACTAGTCCTTTAAGTTAAGAGAAGGCTAAGGGA 

Asn Ser Leu Pro He Pro 



f- 5670 



CGATGTGGGATAGTAAGAACTGGGATGGACTTC T TGAGATGTTAACATCAtGTCAACCCAA TCCCATCTCAACATCTCAFJATGr aTaaa 
CCTACACCCTATCATTrTTrArrr-TAr^-rJ.. ' L 1 ' f 4 *~ 1 i 1 



*■ 5760 
TCTACCTATTTA 



GCTACACCCTATCATTCTTGACCCTACCTCAAGAACTCTACAATrGTAGTACAGTTCGGTTAGGGTAGAGTTGTAGAG 

Ser net Trp Asp Ser Lys Asn Trp Asp G.y Vol Leu Glu h et Leu Thr Ser Cys Gin Alo Asn Pro lie Ser Thr Ser Gin Met His Lys 
" " " ~ *L _ 



20/26 



FIG. 2 



BNSDOCID: <WO 



„9634625A1_L> 



WO 96/34625 



PCT/US96/06053 



N 



Sea xp m 



leader RNA 
vRNA 



. ? . UCAGGAGAAAC Pp AACAGUAAUC . f . Ifo 



. ? . AGUCCUCUUUGAAAUUGUCAUUAG . V. 



mRNAs . .?.GCUACAUAUG-poly(A) " P AACAGAUAUC. lS ; 1°) 

o * 

vRNA . . . CGAUGUAUACUUUUUUUGAUUGUCUAUAG . : . 



NS 



mRNAs 
vRNA 



B . NS. P p M 

. . GUAGACUAUG • poly(A) P AACAGAUAUC . ? . 2.1 1 2-2- 
? . CAUCUGAUACUUUUUUUCAUUGUCUAUAG . 



rnRNAs 
vRNA 



A* Pp <S 

. . UAUCCCUAUG • poly(A) P AACAGAGAUC . 2-4* , 2-5 

. . AUAGGGAUACUUUUUUUGAUUGUCUCUAG . ?*. 2.£> 



mRNAs 
vRNA 



. . . AAUUUUUAUG - poly(A) P AACAGCAAUC . ? . 2.? , 2jD 



3* 



. ? . UUAAAAAUACUUUUUUUGAUUGUCGUUAG . 251 



mRNAs 
vRNA 



5' 



. . .UUUAAGUAUG-poIy(A) 



3* 



. AAAUUCAUACUUUUUUUGAAACUAGGA . 



5* 



3o 
31 
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RECOMBINANT VESICULOVIRUSES AN D THEIR USES 

This invention was made with government support 
under grant number R37 AI243245 awarded by the National 
5 Institutes of Health. The government has certain rights in 
the invention, 

1. INTRODUCTION 
The present invention relates to recombinant 
10 vesiculoviruses which are replicable and capable of 

expressing foreign nucleic acid contained in their genome. 
Also provided are inactivated forms of the recombinant 
viruses. The vesiculoviruses are useful in vaccine 
formulations to prevent or treat various diseases and 
15 disorders. 

2. BACKGROUND OF THE INVENTION 
2.1. RHABDOVIRUSES 

Rhabdoviruses are membrane-enveloped viruses that 

2 0 are widely distributed in nature where they infect 

vertebrates, invertebrates, and plants. There are two 
distinct genera within the rhabdoviruses , the Lyssavirus 
genus and the Vesiculovirus genus. Rhabdoviruses have 
single, negative-strand RNA genomes of 11-12,000 nucleotides 

25 (Rose. and Schubert, 1987, Rhabdovirus genomes and their 
products, in The Viimses: The Rhabdoviiruses , Plenum 
Publishing Corp., NY, pp. 129-166). The virus particles 
contain a helical, nucleocapsid core composed of the genomic 
RNA and protein. Generally, three proteins, termed N 

30 (nucleocapsid), P (formerly termed NS, originally indicating 
nonstructural), and L (large) are found to be associated with 
the nucleocapsid. An additional matrix (M) protein lies 
within the membrane envelope, perhaps interacting both with 
the membrane and the nucleocapsid core. A single 

35 glycoprotein (G) species spans the membrane and forms the 
spikes on the surface of the virus particle. Because the 
genome is the negative sense [i.e., complementary to the RNA 
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sequence (positive sense) that functions as inRNA to directly 
produce encoded protein] , rhabdoviruses must encode and 
package an RNA-dependent RNA polymerase in the virion 
(Baltimore et al., 1970, Proc. Natl. Acad. Sci. USA 66: 
5 572-57 6) , composed of the P and L proteins. This enzyme 

transcribes genomic RNA to make subgenomic MRNAS encoding the 
5-6 viral proteins and also replicates full-length positive 
and negative sense RNAs. The genes are transcribed 
sequentially, starting at the 3' end of the genomes. The 
10 same basic genetic system is also employed by the 
paramyxoviruses and filoviruses. 

The prototype rhabdovirus, vesicular stomatitis 
. virus (VSV) , grows to very high titers in most animal cells 

and can be prepared in large quantities. As a result, VSV 
15 has been widely used as a model system for studying the 

replication and assembly of enveloped RNA viruses. The study 
of VSV and related negative strand viruses has been limited 
by the inability to perform direct genetic manipulation of 
. .;■ virus using recombinant DNA technology. The difficulty 
2 0 in generating VSV from DNA is that neither the full-length 
genomic nor antigenomic RNAs are infectious. The minimal 
infectious unit is the genomic RNA tightly bound to 1,250 
subunits of the nucleocapsid (N) protein (Thomas et al. , 
1985, J. Virol. 54:598-607) and smaller amounts of the two 
25 virally encoded polymerase subunits, L and P. To 

reconstitute infectious virus from the viral RNA, it is 
necessary first to assemble the N protein-RNA complex that 
serves as the template for transcription and replication by 
the VSV polymerase. Although smaller negative-strand RNA 
30 segments of the influenza virus genome can be packaged into 
nucleocapsids in vitro, and then rescued in influenza 
infected cells (Enami et al., 1990, Proc. Natl. Acad. Sci. 
USA 87:3802-3805; Luytjes et al., 1989, Cell 59:1107-1113), 
systems for packaging the much larger rhabdoviral genomic 
35 RNAs in vitro are not yet available. 

Recently, systems for replication and transcription 
of DNA-derived minigenomes or small defective RNAs from 
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rhabdoviruses (Conzelmann and Schnell, 1994, J. Virol. 
68:713-719; Pattnaik et al. # 1992, Cell 69:1011-1120) and 
paramyxoviruses (Calain et al., 1992, Virology 191:62-71; 
Collins et al., 1991, Proc. Natl- Acad. Sci. USA 
5 88:9663-9667; Collins et al., 1993, Virology 195:252-256; De 
and Banerjee, 1993, Virology 196:344-348; Dimock and Collins, 
1993, J. Virol. 67:2772-2778; Park et al., 1991, Proc. Natl. 
Acad. Sci. USA 88:5537-5541) have been described. In these 
systems, RNAs are assembled into nucleocapsids within cells 

10 that express the viral N protein and polymerase proteins. 
Although these systems have been very useful, they do not 
allow genetic manipulation of the full-length genome of 
infectious viruses. 

The recovery of rabies virus from a complete cDNA 

15 clone was published recently (Schnell et al., 1994, EMBO J. 
13:4195-4203). The infectious cycle was initiated by 
expressing the antigenomic (full-length positive strand) RNA 
in cells expressing the viral N, P, and L proteins. Although 
- rabies virus is a rhabdcvirusv- it is structurally and 

20 functionally different from the vesiculoviruses. Rabies 
virus is a Lyssavirus, not a Vesiculovirus. Lyssaviruses 
invade the central nervous system. Vesiculoviruses invade 
epithelial cells, predominantly those of the tongue, to 
produce vesicles. Rabies virus causes encephalitis in a 

25 variety of animals and in humans, while VSV causes an 
epidemic but self -limiting disease in cattle. In sharp 
contrast to VSV-infected cells, rabies virus produces little 
or no cytopathic effect in infected cell culture, replicates 
less efficiently than VSV in cell culture, and causes little 

30 depression of cellular DNA, RNA or protein synthesis in 

infected cell cultures (see Baer et al., 1990, in Virology, 
2d ed., Fields et al. (eds.), Raven Press, Ltd., NY, pp. 883, 
887) . Indeed, there is no cross-hybridization observed 
between the genomes of rabies virus and VSV, and sequence 

35 homology between the two genomes is generally discernable 
only with the aid of computer run homology programs. The 
differences between vesiculoviruses and rabies virus, and the 
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extremely rare nature of rabies virus recovery from cDNA (-10 
cells are transfected to yield one infectious cell) , renders 
it unpredictable whether the strategy used with rabies virus 
would be successful for viruses of a different genus, i.e., 
5 the vesiculoviruses. 

The recovery of infectious measles virus, another 
negative strand RNA virus, from cloned cDNA has been 
attempted, without success (see Ballart et al., 1990, EMBO J. 
9(2):379-384 and the retraction thereof by Eschle et al., 
10 1991, EMBO J. 10(11) :3558) . 

2.2. VACCINES 
The development of vaccines for the prevention of 
viral, bacterial, or parasitic diseases is the focus of much 

15 research effort. 

Traditional ways of preparing vaccines include the 
use of inactivated or attenuated pathogens. A suitable 
inactivation of the pathogenic microorganism renders it 
hrr=;less as a biological agent but does not destroy its 

2 0 immunogenicity. Injection of these "killed" particles into a 
host will then elicit an immune response capable of 
preventing a future infection with a live microorganism. 
However, a major concern in the use of killed vaccines (using 
inactivated pathogen) is failure to inactivate all the 

25 microorganism particles. Even when this is accomplished, 
since killed pathogens do not multiply in their host, or for 
other unknown reasons, the immunity achieved is often 
incomplete, short lived and requires multiple immunizations. 
Finally, the inactivation process may alter the 

30 microorganism's antigens, rendering them less effective as 
immunogens . 

Attenuation refers to the production of strains of 
pathogenic microorganisms which have essentially lost their 
disease-producing ability. One way to accomplish this is to 
35 subject the microorganism to unusual growth conditions and/or 
frequent passage in cell culture. Mutants are then selected 
which have lost virulence but yet are capable of eliciting an 
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immune response. Attenuated pathogens often make good 
immunogens as they actually replicate in the host cell and 
elicit long lasting immunity. However, several problems are 
encountered with the use of live vaccines, the most worrisome 
5 being insufficient attenuation and the risk of reversion to 
virulences 

An alternative to the above methods is the use of 
subunit vaccines. This involves immunization only with those 
components which contain the relevant immunological material. 
10 Vaccines are often formulated and inoculated with 

various adjuvants. The adjuvants aid in attaining a more 
durable and higher level of immunity using small amounts of 
antigen or fewer doses than if the immunogen were 
administered alone. The mechanism of adjuvant action is 

i 

15 complex and not completely understood. However, it may 

involve the stimulation of cytokine production, phagocytosis 
and other activities of the reticuloendothelial system as 
well as a delayed release and degradation of the antigen. 
Examples of adjuvants include "F^rySr» i , .s' adjuvant (complete or 

20 incomplete), Adjuvant 65 (containing peanut oil, mannide 
monooleate and aluminum monostearate) , the pluronic polyol 
L-121, Avridine, and mineral gels such as aluminum hydroxide, 
aluminum phosphate, etc. Freund's adjuvant is no longer used 
in vaccine formulations for humans because it contains 

25 nonmetabolizable mineral oil and is a potential carcinogen. 

3 . SUMMARY OF THE INVENTION 
The present invention provides recombinant 
replicable vesiculoviruses. The prior art has unsuccessfully 

30 attempted to produce replicable vesiculoviruses from cloned 
DNA. In contrast, the invention provides a method which, for 
the first time, has successfully allowed the production and 
recovery of replicable vesiculoviruses, as well as 
recombinant replicable vesiculoviruses, from cloned DNA. 

35 The vesiculoviruses of the invention are produced 

by providing in an appropriate host cell: (a) DNA that can 
be transcribed to yield (encodes) vesiculovirus antigenomic 
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(+) RNA (complementary to the vesiculovirus genome), (b) a 
recombinant source of vesiculovirus N protein, (c) a 
recombinant source of vesiculovirus P protein, and (d) a 
recombinant source of vesiculovirus L protein; under 
5 conditions such that the DNA is transcribed to produce the 
antigenomic RNA, and a vesiculovirus is produced that 
contains genomic. RNA complementary to the antigenomic RNA 
produced from the DNA. 

The invention provides an infectious recombinant 
10 vesiculovirus capable of replication in an animal into which 
the recombinant vesiculovirus is introduced, in which the 
genome of the vesiculovirus comprises foreign RNA which is 
not naturally a part of the vesiculovirus genome. The 
recombinant vesiculovirus is formed by producing 
15 vesiculoviruses according to the method of the invention, in 
which regions of, the DNA encoding vesiculovirus antigenomic 
(+) RNA that are nonessential for viral replication have been 
inserted into or replaced with foreign DNA. 

•• In a preferred embodiment, the foreign RNA 
20 contained within the genome of the recombinant vesiculovirus 
(originally encoded by the foreign DNA) , upon expression in 
an appropriate host cell, produces a protein or peptide that 
is antigenic or immunogenic. 

The recombinant vesiculoviruses of the invention 
25 have use as vaccines. In one embodiment, where the foreign 
RNA directs production of an antigen that induces an immune 
response against a pathogen, the vaccines of the invention 
have use in the treatment or prevention of infections by such 
a pathogen (particularly a pathogenic microorganism) , and its 
30 clinical manifestations, i.e., infectious disease. In a 
preferred embodiment, such an antigen displays the 
antigenicity or immunogenicity of an envelope glycoprotein of 
a virus other than a vesiculovirus, and the antigen is 
incorporated into the vesiculovirus envelope. The 
35 recombinant vesiculoviruses also have uses in diagnosis, and 
monitoring progression of infectious disorders, including 
response to vaccination and/or therapy. 
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In another embodiment, where the foreign RNA 
directs production of an antigen that induces an immune 
response against a tumor, the recombinant viruses of the 
invention have uses in cancer immunoprophylaxis, 
5 immunotherapy, and diagnosis, and monitoring of tumor 
progression or regression. 

The recombinant vesiculoviruses can be used as live 
vaccines, or can be inactivated for use as killed vaccines. 
The recombinant viruses can also be used to produce large 
10 quantities of readily purified antigen, e.g., for use in 
subunit vaccines. 

The invention also provides vaccine formulations, 
kits, and recombinant host cells. 

15 -4. DESCRIPTION OF THE FIGURES 

Figure 1. Nucleotide sequence of plasmid 
pVSVFL(+), showing the complete DNA sequence that is 
transcribed to produce VSV antigenomic (+) RNA, and predicted 
ZBqneiicz3 of the encoded VSV protains^. .jn protein: 
20 SEQ ID NO:2; P protein: SEQ ID NO: 3; M protein: SEQ ID 
NO: 4; G protein: SEQ ID NO: 5; L protein: SEQ ID NO: 6] The 
noncoding and intergenic regions are observable. The upper 
line of sequence (SEQ ID N0:l) is the VSV antigenomic 
positive strand; lower line = SEQ ID NO: 7. Restriction sites 
25 are indicated. The transmembrane and cytoplasmic domains of 
the G protein are also indicated. The sequences of the first 
T7 RNA polymerase promoter (SEQ ID NO: 8), the second T7 RNA 
polymerase promoter (SEQ ID NO: 9); leader sequence (SEQ ID 
NO: 10), T7 RNA polymerase transcription termination signal 
30 (SEQ ID NO: 11), and the sequence that is transcribed to 
produce the HDV ribozyme (SEQ ID NO: 12) are shown. 

Figure 2. Nucleotide sequence of a portion of 
plasmid pVSVSSl, showing the synthetic DNA insert containing 
the polylinker region inserted between the G and L coding 
35 regions (3' of G and 5' of L) containing unique restriction 
enzyme recognition sites, namely, Xmal, NotI, and Smal. 
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Upper line of sequence: SEQ ID NO: 13; lower line of 
sequence: SEQ ID NO: 14. 

Figure 3. Gene junctions of VSV. The nucleotide 
sequences at the 3 ' end of the leader RNA and the 5' and 3' 
5 ends of each mRNA are shown along with the corresponding 
genomic sequences (vRNA) (SEQ ID N0M5-31). The intergenic 
dinucleotides are indicated by bold letters. From Rose and 
Schubert, 1987, in The Vijruses: The Rhabdoviruses , Plenum 
Press, NY, pp. 129-166, at p. 136. 
" Figure 4. Plasmid DNA construction. A. The 

diagram illustrates the cloned VSV genomic sequence and the 
four DNA fragments (numbered 1-4) that were used to generate 
the plasmid pVSVFL(+) . The horizontal arrows represent PCR 
primers used to generate fragments 1 and 3. B. Diagram of 
15 the plasmid pVSVFL( + ) that gives rise to infectious VSV. 

The locations of the VSV genes encoding the five proteins N, 
P, M, G, and L are shown. The stippled region from Sac I to 
Xho I represents the pBSSK + vector sequence, and the hatched 
se^itr.^^ ^present the regions of the VSV genome generated by 
2 0 PCR. Transcription from the T7 promoter generates the 
complete (+) strand VSV RNA. 

Figure 5. Proteins present in wild-type and 
recombinant VSVs. Proteins from 1% of the virus recovered 
from approximately 5 x 10 6 infected BHK cells were separated 
25 by SDS-PAGE (10% acrylamide) and visualized by staining with 
Coomassie brilliant blue. Positions of the five VSV proteins 
are indicated. 

Figure 6. Identification of a restriction enzyme 
recognition sequence in the recombinant VSV. A 62 0 

30 nucleotide segment of genomic RNA isolated from wildtype and 
recombinant VSV was amplified by reverse transcription and 
PCR using the primers 5 ' -CATTCAAGACGCTGCTTCGCAACTTCC 
(SEQ ID NO:32) and 5 ' -CATGAATGTTAACATCTCAAGA (SEQ ID NO:33). 
Controls in which reverse transcriptase was omitted from the 

35 reaction are indicated. DNA samples were either digested 
with Nhe I or left undigested prior to electrophoresis on a 
6% polyacrylamide gel as indicated. DNA was detected by 
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staining with ethidium bromide. Sizes of DNA markers are 
indicated on the left. 

Figure 7. Autoradiogram showing the sequence of 
genomic RNA from recombinant VSV. RNA prepared from 
5 recombinant VSV was sequenced by the dideoxy method using 
reverse transcriptase. The written sequence corresponds to 
nucleotides 1563-1593 in the G mRNA (Rose and Gallione, 1981, 
J. Virol. 39:519-528). The underlined sequence represents 
the four nucleotides that were changed to generate the Nhe 1 
10 site. 

Figure 8. Protein analysis of recombinant VSV 
expressing the glycoprotein from the New Jersey serotype. 
Proteins from 1% of the virus pelleted from the medium of 
approximately 5 x 10 6 BHK cells infected for 24 hours with 
15 wildtype VSV, (lane 1), recombinant VSV^ (lane 2) or wildtype 
VSV^lane 3) were separated by SDS-PAGE (10% acrylamide) . 
The proteins were visualized by staining with Coomassie 
brilliant blue. Positions of viral proteins are indicated. 

20 . 5 - DETAILED DESCRIPT ION OF THE INVENTION 

The present invention provides recombinant 
replicable vesiculoviruses. The prior art has unsuccessfully 
attempted to produce replicable vesiculoviruses from cloned 
DNA. In contrast, the invention provides a method which, for 

25 the first time, has successfully allowed the production and 
recovery of replicable vesiculoviruses, as well as 
recombinant replicable vesiculoviruses, from cloned DNA. 
Expression of the full-length positive-strand vesiculovirus 
RNA in host cells has successfully allowed the generation of 

30 recombinant vesiculoviruses from DNA, providing recombinant 
viruses that do not cause serious pathology in humans and 
that can be obtained in high titers, that have use as 
vaccines. 

The vesiculoviruses of the invention are produced 
35 by providing in an appropriate host cell: (a) DNA that can 
be transcribed to yield (encodes) vesiculovirus antigenomic 
(+) RNA (complementary to the vesiculovirus genome) , (b) a 
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recombinant source of vesiculovirus N protein, (c) a 
recombinant source of vesiculovirus P protein, and (d) a 
recombinant source of vesiculovirus L protein; under 
conditions such that the DNA is transcribed to produce the 
5 antigenomic ,RNA, and a vesiculovirus is produced that 
contains genomic RNA complementary to the antigenomic RNA 
produced from the DNA. 

The invention provides an infectious recombinant 
vesiculovirus capable of replication in an animal into which 
10 the recombinant vesiculovirus is introduced, in which the 
genome of the vesiculovirus comprises foreign RNA which is 
not naturally a part of the vesiculovirus genome. The 
recombinant vesiculovirus is formed by producing 
vesiculoviruses according to the method of the invention, in 
15 which regions of the DNA encoding vesiculovirus antigenomic 
(+) RNA that are nonessential for viral replication have been 
inserted into or replaced with foreign DNA. 

Since the viruses are replicable (i.e., not 
replic- -innwfef active) , they encode all the vesiculovirus 
20 machinery necessary for replication in a cell upon infection 
by the virus. 

In a preferred embodiment, the recombinant 

vesiculovirus is a recombinant vesicular stomatitis virus 
(VSV) . 

25 Ir > another preferred embodiment, the foreign RNA 

contained within the genome of the recombinant vesiculovirus 
(originally encoded by the foreign DNA) , upon expression in 
an appropriate host cell, produces a protein or peptide that 
is antigenic or immunogenic. Such an antigenic or 

30 immunogenic protein or peptide whose expression is directed 
by the foreign RNA (present in the negative sense) within the 
vesiculovirus genome (by expression from the (+) antigenomic 
message) shall be referred to hereinafter as the "Antigen. ,f 
Appropriate Antigens include but are not limited to known 

35 antigens of pathogenic microorganisms or of tumors, as well 
as fragments or derivatives of such antigens displaying the 
antigenicity or immunogenicity of such antigens. A protein 
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displays the antigenicity of an antigen when the protein is 
capable of being immunospecif ically bound by an antibody to 
the antigen. A protein displays the immunogenicity of an 
antigen when it elicits an immune response to the antigen 
5 (e.g., when immunization with the protein elicits production 
of an antibody that immunospecif ically binds the antigen or 
elicits a cell-mediated immune response directed against the 
antigen) . 

The recombinant vesiculoviruses of the invention 

10 have use as vaccines. In one embodiment, where the foreign 
RNA directs production of an Antigen (originally encoded by 
the foreign DNA used to produce the recombinant vesiculovirus 
or its predecessor) that induces an immune response against a 
pathogen, the vaccines of the invention have use in the 

15 treatment or prevention of infections by such a pathogen 
(particularly a pathogenic microorganism), and its clinical 
manifestations, i.e., infectious disease. The invention thus 
provides methods of prevention or treatment of infection and 
infectious ^disease comprising administ ^ring^to subject in 

20 which such treatment or prevention is desired one or more of 
the recombinant vesiculoviruses of the invention. The 
recombinant vesiculoviruses also have uses in diagnosis, and 
monitoring progression of infectious disorders, including 
response to vaccination and/ or therapy. 

25 In another embodiment, where the Antigen induces an 

immune response against a tumor, the recombinant viruses of 
the invention have uses in cancer immunoprophylaxis, 
immunotherapy, and diagnosis, and monitoring of tumor 
progression or regression. 

30 The recombinant vesiculoviruses can be used as live 

vaccines, or can be inactivated for use as killed vaccines. 
The recombinant viruses can also be used to produce large 
quantities of readily purified antigen, e.g., for use in 
subunit vaccines. 

35 In a specific embodiment, the foreign DNA used 

initially for production of the recombinant vesiculoviruses 
can also comprise a sequence encoding a detectable marker, 
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e.g., /?-galactosidase, /3-glucuronidase, £-geo (Friedrich & 
Soriano, 1991, Genes Dev. 5:1513-1523). 

In another specific embodiment, the foreign DNA can 
also comprise a sequence encoding a cytokine capable of 
5 stimulating an immune response. Such cytokines include but 
are not limited to, inter leukin-2 , inter leukin-6, 
interleukin-12 , interferons, and granulocyte-macrophage 
colony stimulating factors. 

In a preferred aspect, upon infection with a 

10 recombinant vesiculovirus of the invention, the Antigen is 
expressed as a nonfusion protein. In a less preferred 
embodiment, the Antigen is expressed as a fusion protein, 
e.gr., to the viral G protein. "Fusion protein," as used 
herein, refers to a protein comprising an amino acid sequence 

15 from a first protein covalently linked via a peptide bond at 
its carboxy terminus to the amino terminus of an amino acid 
sequence from a second, different protein. 

In one embodiment, a vaccine formulation of the 
invention icon tain 5? a single type of recombinant vesiculovirus 

2 0 of the invention. In another embodiment, a vaccine 

formulation comprises a mixture of two or more recombinant 
viruses of the invention. 

The vaccine formulations of the invention provide 
one or more of the following benefits: stability for long 

25 periods without refrigeration; ease of production; low cost 
and high titer of production; ability to be administered by 
local workers without advanced medical training; and 
involving administration of a microorganism that is known not 
to cause serious disease in humans. 

30 The present invention also provides a host cell 

infected with a recombinant vesiculovirus capable of 
replication. In one embodiment, the host cell is a mammalian 
cell. Preferably, the mammalian cell is a hamster kidney 
cell. 

35 
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5.1. DNA THAT CAN BE TRANSCRIBED TO PRODUCE 
VESICULOVIRUS ANT I GENOMIC (+) RNA 

Many vesiculoviruses are known in the art and can 
be made recombinant according to the methods of the 
r invention. Examples of such vesiculoviruses are listed in 
Table I. 



TABLE I 

MEMBERS OF THE VESICULOVIRUS GENUS 

10 

Virus Source of virus in nature 



VSV-New Jersey Mammals, mosquitoes, midges, 

blackflies, houseflies 





VSV-Indiana 


Mammals, mosquitoes, 


sandflies 


15 


Alagoas 


Mammals, sandflies 






Cocal 


Mammals, mosquitoes, 


mites 




Jurona 


Mosquitoes 






Carajas 


Sandflies 




20 










Maraba 


Sandflies 






Piry 


Mammals 






Calchagui 


Mosquitoes 




25 


Yug Bogdanovac 


Sandflies 






Isfahan 


Sandflies, ticks 






Chandipura 


Mammals, sandflies 






Perinct 


Mosquitoes , sandf liej 




30 


Porton-S 


Mosquitoes 





Any DNA that can be transcribed to produce 
vesiculovirus antigenomic ( + ) RNA (complementary to the VSV 
genome) can be used for the construction of a recombinant DNA 
containing foreign DNA encoding an Antigen, for use in 
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producing the recombinant vesiculoviruses of the invention. 
DNA that can be transcribed to produce vesiculovirus 
antigenomic ( + ) RNA (such DNA being referred to herein as 
"vesiculovirus (-) DNA") is available in the art and/or can 
5 be obtained by standard methods. In particular, plasmid 
pVSVFL( + ) , containing VSV (-) DNA that is preferred for use 
in the present invention, has been deposited with the ATCC 
and assigned accession no. 97134. In a preferred aspect, DNA 
that can be transcribed to produce VSV ( + ) RNA, [i.e., 

10 VSV (-) DNA], is used. VSV (-) DNA for any serotype or 
strain known in the art, e.g., the New Jersey or Indiana 
serotypes of VSV, can be used. The complete nucleotide and 
deduced protein sequence of the VSV genome is known, and is 
available as Genbank VSVCG, Accession No. J02428 ; NCBI Seq ID 

15 335873; and is published in Rose and Schubert, 1987, in The 
Viruses; The Rhabdoviruses , Plenum Press, NY, pp. 129-166. 
Partial sequences of other vesiculovirus genomes have been 
published and are available in the art. The complete 
sequence cf the ■■•■VJV(-) DNA that is used ir. ?~ T pFef>rred 

2 0 embodiment is contained in plasmid pVSVFL( + ) and is shown in 
Figure 1; also shown are with the predicted sequences of the 
VSV proteins (this sequence contains several sequence 
corrections relative to that obtainable from Genbank) . 
Vesiculovirus (-) DNA, if not already available, can be 

25 prepared by standard methods, as follows: If vesiculoviral 
cDNA is not already available, vesiculovirus genomic RNA can 
be purified from virus preparations, and reverse 
transcription with long distance polymerase chain reaction 
used to generate the vesiculovirus (-) DNA. Alternatively, 

30 after purification of genomic RNA, VSV mRNA can be 

synthesized in vitro, and cDNA prepared by standard methods, 
followed by insertion into cloning vectors (see, e.g., Rose 
and Gallione, 1981, J. Virol . 39 (2) : 519-528) . Individual 
cDNA clones of vesiculovirus RNA can be joined by use of 

35 small DNA fragments covering the gene junctions, generated by 
use of reverse transcription and polymerase chain reaction 
(RT-PCR) (Mullis. and Faloona, 1987, Meth. Enzymol . 
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155:335-350) from VSV genomic RNA (see Section 6, infra). 
Vesiculoviruses are available in the art. For example, VSV 
- can be obtained from the American Type Culture Collection. 

In a preferred embodiment, one or more, preferably 
5 unique, restriction sites (e.g., in a polylinker) are 
introduced into the vesiculovirus (-) DNA, in intergenic 
regions, or 5' of the sequence complementary to the 3' end of 
the vesiculovirus genome, or 3' of the sequence complementary 
to the 5' end of the vesiculovirus genome, to facilitate 
10 insertion of the foreign DNA. 

In a preferred method of the invention, the 
vesiculovirus (-) DNA is constructed so as to have a promoter 

operatively linked thereto. The promoter should be capable 

f 

of initiating transcription of the (-) DNA in an animal or 

15 insect cell in which it is desired to produce the recombinant 
vesiculovirus. Promoters which may be used include, but are 
not limited to, the SV4 0 early promoter region (Bernoist and 
Chambon, 1981, Nature 290:304-310), the promoter contained in 
the 3 V long terminal repeat of Rous sarccsva virus (Yamamoto, 

20 et al., 1980, Cell 22:787-797), the herpes thymidine kinase 
promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 
78:1441-1445), the regulatory sequences of the 
metallothionein gene (Brinster et al., 1982, Nature 
296:39-42); heat shock promoters (e.g., hsp70 for use in 

25 Drosophila S2 cells); the ADC (alcohol dehydrogenase) 

promoter, PGK (phosphoglycerol kinase) promoter, alkaline 
phosphatase promoter, and the following animal 
transcriptional control regions, which exhibit tissue 
specificity and have been utilized in transgenic animals: 

30 elastase I gene control region which is active in pancreatic 
acinar cells (Swift et al., 1984, Cell 38:639-646; Ornitz et 
al., 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 
MacDonald, 1987, Hepatology 7:425-515); insulin gene control 
region which is active in pancreatic beta cells (Hanahan, 

35 1985, Nature 315:115-122), immunoglobulin gene control region 
which is active in lymphoid cells (Grosschedl et al., 1984, 
Cell 38:647-658; Adames et al., 1985, Nature 318:533-538; 
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r 



Alexander et al., 1987, Mol. Cell. Biol. 7:143 6-1444), mouse 
mammary tumor virus control region which is active in 
testicular, breast, lymphoid and mast cells (Leder et al., 
1986, Cell 45:485-495), albumin gene control region which is 
5 active in liver (Pinkert et al . , 1987, Genes and Devel. 
1:268-276), alpha-f etoprotein gene control region which is 
active in liver (Krumlauf et al., 1985, Mol. Cell. Biol. 
5:1639-1648; Hammer et al., 1987, Science 235:53-58; alpha 1- 
antitrypsin gene control region which is active in the liver 

10 (Kelsey et al. , 1987, Genes and Devel. 1:161-171), beta- 
globin gene control region which is active in myeloid cells 
(Mogram et al. r 1985, Nature 315:338-340; Kollias et al., 
1986, Cell 46:89-94; myelin basic protein gene control region 
which is active in oligodendrocyte cells in the brain 

15 (Readhead et al., 1987, Cell 48:703-712); and myosin light 
chain-2 gene control region which is active in skeletal 
muscle (Sani, 1985, Nature 314:283-286). Preferably, the 
promoter is an RNA polymerase promoter, preferably a 
bacterioj-^ge t»r or insect RNA polymer*/,** Dromorar, 

20 including but not limited to the promoters for T7 RNA 

polymerase, SP6 RNA polymerase, and T3 RNA polymerase. If an 
RNA polymerase promoter is used in which the RNA polymerase 
is not endogenously produced by the host cell in which it is 
desired to produce the recombinant vesiculovirus, a 

25 recombinant source of the RNA polymerase must also be 
provided in the host cell. 

The vesiculovirus (-) DNA can be operably linked to 
a promoter before or after insertion of foreign DNA encoding 
an Antigen. Preferably, a transcriptional terminator is 

30 situated downstream of the vesiculovirus (-) DNA. 

In another preferred embodiment, a DNA sequence 
that can be transcribed to produce a ribozyme sequence is 
situated at the immediate 3' end of the vesiculovirus 
DNA, prior to the transcriptional termination signal, so that 

35 upon transcription a self-cleaving ribozyme sequence is 

produced at the 3' end of the antigenomic RNA, which ribozyme 
sequence will autolytically cleave (after a U) this fusion 
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transcript to release the exact 3' end of the vesiculovirus 
antigenomic ( + ) RNA. Any ribozyme sequence known in the art 
may be used, as long as the correct sequence is recognized 
and cleaved. (It is noted that hammerhead ribozyme is 
5 probably not suitable for usei) In a preferred aspect, 
hepatitis delta virus (HDV) ribozyme is used (Perrotta and 
Been, 1991, Nature 350:434-436; Pattnaik et al., 1992, Cell 
69:1011-1020) . 

A preferred VSV(-) DNA for use, for insertion of 

10 foreign DNA, is that shown in Fig. 1 and contained in plasmid 
pVSVFL(+) , in which a T7 RNA polymerase promoter is present 
5' of the sequence complementary to the 3' end of the VSV 
genome. Plasmid pVSVFL(+) thus comprises (in 5' to 3' order) 
the following operably linked components: the T7 RNA 

15 polymerase promoter, VSV (-) DNA, a DNA sequence that is 

transcribed to produce an HDV ribozyme sequence (immediately 
downstream of the VSV (-) DNA) , and a T7 RNA polymerase 
transcription termination site. A plasmid that can also be 
- made and used is plasmid pVSVSSl, a portion' of the ^sc^uence 

2 0 of which is shown in Fig. 2, in which a synthetic DNA 

poly linker, facilitating insertion of foreign DNA, has been 
inserted into pVSVFL(+) between the G and L coding regions. 
The polylinker was synthesized on a DNA synthesizer so as to 
have ends compatible for ligation into an Nhel site, and to 

25 contain the unique restriction enzyme recognition sites Xmal, 
Smal, and NotI, facilitating insertion of foreign DNA 
generated by cleavage with one of these enzymes or ligated to 
a linker containing a recognition site for one of these 
enzymes (which is then subjected to cleavage prior to 

30 insertion) . 

The foreign DNA encoding an Antigen is inserted 
into any region, or replaces any region, of the 
vesiculovirus (-) DNA that is not essential for vesiculovirus 
replication. In a preferred embodiment, the foreign DNA is 

35 thus inserted into an intergenic region, or a portion of the 
vesiculovirus (-) DNA that is transcribed to form the 
noncoding region of a viral mRNA. In a preferred embodiment, 
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the invention provides a nucleic acid comprising the DNA 
sequence of plasmid pVSVFL(+) as depicted in Figure 1 from 
nucleotide numbers 623-12088 (a portion of SEQ ID N0:1) , in 
which a region nonessential for vesiculovirus replication has 
5 been inserted into or replaced by foreign DNA. 

Vesiculoviruses have a defined intergenic 
structure. Extensive homologies are found around the 
intergenic dinucleotides (Fig. 3). These regions have the 
common structure ( 3 ' ) AUACUUUUUUUNAUUGUCNNUAG ( 5 ' ) 

10 (SEQ ID NO:34), in which N indicates any nucleotide (thus 
three variable positions are present) and the intergenic 
dinucleotide is underlined. These dinucleotide spacers are 
GA, except at the NS-M junction, where the dinucleotide is 
CA. The first 11 nucleotides of the common sequence are 

15 complementary to the. sequence (5') . . . UAUGAAAAAAA . . . 
(3') (SEQ ID NO: 35) that occurs at the 

mRNA-polyadenylatetpoly(A)] junction in each mRNA including 
L. Reiterative copying of the U residues by the VSV 
polymerase pyaswily ^en-rates the poly (A) tail on each m^NA 

20 (McGeoch, 1979, Cell 17:3199; Rose, 1980, Cell 19:415; 
Schubert et al., 1980, J. Virol. 34:550). The sequence 
complementary to the 5' end of the mRNA follows the 
intergenic dinucleotide. The L mRNA also terminates with the 
sequence UAUG-poly(A) encoded by the sequence ( 3 ' ) AUACUUUUUUU 

25 (SEQ ID NO: 36) and is presumably also polyadenylated by a 
polymerase "slippage" mechanism (Schubert et al. , 1980, J. 
Virol. 34:550; Schubert and Lazarini, 1981, J. Virol. 
38:256). 

Thus, intergenic regions in vesiculovirus (-) DNA 
30 consist of three parts, triggering transcriptional 

termination and reinitiation present both 5' and 3' to each 
gene (presented as the 5' to 3' sequence of the positive 
sense strand of vesiculovirus (-) DNA) : (a) TATGAAAAAAA 
(SEQ ID NO:37), followed by (b) the dinucleotide GT or CT, 
35 followed by (c) AACAG. Therefore, in a preferred aspect, 

foreign DNA encoding an Antigen can readily be expressed as a 
nonfusion protein from intergenic regions, simply by ensuring 
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that this three-part intergenic region is reconstituted — 
i.e., that this intergenic region appears 5' and 3' to the 
foreign DNA and also 5' and 3' to the adjacent genes. For 
example, in a preferred embodiment, DNA consisting of (a) 
5 this three-part intergenic region, fused to (b) foreign DNA 
coding for a desired Antigen (preferably including the 
Antigen gene's native start and stop codons for initiation), 
is inserted into a portion of the vesiculovirus (-) DNA that 
is transcribed to form the 3' noncoding region of any 
10 vesiculovirus mRNA. In a particularly preferred aspect, the 
foreign DNA is inserted in the noncoding region between G and 
L. 

In an alternative embodiment, the foreign DNA can 
be inserted into the G gene, so as to encode a fusion protein 
15 with G, for resultant surface display of the Antigen on the 
vesiculovirus particle. Selection should be undertaken to 
ensure that the foreign DNA insertion does not disrupt G 
protein function. 

- __ In a pref erre d embodiment, an Antigen. [expressed: by 

20 a recombinant vesiculovirus is all or a portion of an 

envelope glycoprotein of a virus other than a vesiculovirus. 
Such an Antigen can replace the endogenous vesiculovirus G 
protein in the vesiculovirus, or can be expressed as a fusion 
with the endogenous G protein, or can be expressed in 

25 addition to the endogenous G protein either as a fusion or 
nonfusion protein. In a specific embodiment, such an Antigen 
forms a part of the vesiculovirus envelope and thus is 
surface-displayed in the vesiculovirus particle. By way of 
example, gpl60 or a fragment thereof of Human 

30 Immunodeficiency Virus can be the Antigen, which is cleaved 
to produce gpl2 0 and gp41 (see Owens and Rose, 1993, J. 
Virol. 67(1) :360-365) . In a specific embodiment, the G gene 
of VSV in the VSV (-) DNA of plasmid pVSVFL(+) can be easily 
excised and replaced, by cleavage at the Nhel and Mlul sites 

35 flanking the G gene and insertion of the desired sequence. 
In another specific embodiment, the Antigen is a foreign 
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envelope glycoprotein or portion thereof that is expressed as 
a fusion protein comprising the cytoplasmic domain (and, 
optionally, also the transmembrane region) of the native 
vesiculovirus G protein (see Owens and Rose, 1993, J. Virol. 
5 67(1) :360-365) . Such a fusion protein can replace or be 
expressed in addition to the endogenous vesiculovirus G 
protein. As shown by way of example in Section 6 below, the 
entire native G coding sequence can be replaced by a coding 
sequence of a different G to produce recombinant replicable 
10 vesiculoviruses that express a non-native glycoprotein. 
While recombinant vesiculoviruses that express and display 
epitope (s) of envelope glycoproteins of other viruses can be 
used as live vaccines, such vesiculoviruses also are 
particularly useful as killed vaccines, as well as in the 
15 production of subunit vaccines containing the vesiculovirus- 
produced protein comprising such epitope(s). 

In a specific embodiment, a recombinant 
vesiculovirus of the invention expresses in a host to which 
it is administered cm or wcr* Antigens. In one embodiment, 
20a multiplicity of Antigens are expressed, each displaying 
different antigenicity or immunogenicity . 

5 *2. DNA SEQU ENCES ENCODING ANTIGENS 

The invention provides recombinant vesiculoviruses 
25 capable of replication that have a foreign RNA sequence 

inserted into or replacing a site of the genome nonessential 
for replication, wherein the foreign RNA sequence (which is 
in the negative sense) directs the production of an Antigen 
capable of being expressed in a host infected by the 
30 recombinant virus. This recombinant genome is originally 
produced by insertion of foreign DNA encoding the Antigen 
into the vesiculovirus (-) DNA. Any DNA sequence which 
encodes an immunogenic (capable of provoking an immune 
response) Antigen, which produces prophylactic or therapeutic 
35 immunity against a disease or disorder, when expressed as a 
fusion or, preferably, nonfusion protein in a recombinant 
vesiculovirus of the invention, alone or in combination with 
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other Antigens expressed by the same or a different 
vesiculovirus recombinant, can be isolated for use in the 
vaccine formulations of the present invention. 

In a preferred embodiment, expression of an Antigen 
5 by a recombinant vesiculovirus induces an immune response 
against a pathogenic microorganism. For example, an Antigen 
may display the immunogenicity or antigenicity of an antigen 
found on bacteria, parasites, viruses, or fungi which are 
causative agents of diseases or disorders. In a preferred 
10 embodiment, Antigens displaying the antigenicity or 

immunogenicity of antigens of animal viruses of veterinary 
importance (for example, which cause diseases or disorders in 
non-human animals such as domestic or farm animals, e.g., 
cows, chickens, horses, dogs, cats, etc.) are used. In 
15 another embodiment, Antigens displaying the antigenicity or 
immunogenicity of an antigen of a human pathogen are used. 

To determine immunogenicity or antigenicity by 
detecting binding to antibody, various immunoassays known in 
the art can be used, including but not limited cb ^competitive 
20 and non-competitive assay systems using techniques such as 
radioimmunoassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoradiometric assays, gel 
diffusion precipitin reactions, immunodiffusion assays, in 
situ immunoassays (using colloidal gold, enzyme or 
25 radioisotope labels, for example), western blots, 

immunoprecipitation reactions, agglutination assays (e.g., 
gel agglutination assays, hemagglutination assays), 
complement fixation assays, immunofluorescence assays, 
protein A assays, and immunoelectrophoresis assays, etc. In 
30 one embodiment, antibody binding is detected by detecting a 
label on the primary antibody. In another embodiment, the 
primary antibody is detected by detecting binding of a 
secondary antibody or reagent to the primary antibody. In a 
further embodiment, the secondary antibody is labelled. Many 
35 means are known in the art for detecting binding in an 

immunoassay and are envisioned for use. In one embodiment 
for detecting immunogenicity, T cell-mediated responses can 
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be assayed by standard methods, e.g., in vitro cytoxicity 
assays or in vivo delayed-type hypersensitivity assays. 

Parasites and bacteria expressing epitopes 
(antigenic determinants) that can be "expressed by recombinant 
5 vesiculoviruses (wherein the foreign RNA directs the 
production of an antigen of the parasite ' or bacteria or a 
derivative thereof containing an epitope thereof) include but 
are not limited to those listed in Table II. 



10 



15 



20 



25 



TABLE II 

m», m PARASITES ™ D BACTERIA EXPRESSING EPITOPES 
THAT CAN BE EXPRESSED BY R KCOMR t N ANT 3£SICULOVIRUgES 

PARASITES: 

Plasmodium spp. 
Eimeria spp. 

BACTERIA : 

Vibrio cholera* 

Streptococcus pneumoniae 

Neisseria mennigitidis 

Neisseria gonorrhoeae 

Corynebacteria diphtheriae 

Clostridium tetani 

Bordetella pertussis 

Haemophilus spp. (e.g., influenzae) 

Chlamydia spp. 

Enterotoxigenic Escherichia coli 



In another embodiment, the Antigen comprises an 
epitope of an antigen of a nematode, to. protect against 
disorders caused by such worms. 

30 

In another specific embodiment, any DNA sequence 
which encodes a Plasmodium epitope, which when expressed by a 
recombinant vesiculovirus, is immunogenic in a vertebrate 
host, can be isolated for insertion into vesiculovirus (-) 
DNA according to the present invention. The species of 



35 



Plasmodium which can serve as DNA sources include but are not 
limited to the human malaria parasites P. falciparum, P. 
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malarlae, P. ovale, P. vivax, and the animal malaria 
parasites P. berghei, P. yoelli, P. knowlesi, and P. 
cynomolgi. In a particular embodiment, the epitope to be 
expressed is an epitope of the circumsporozoite (CS) protein 
5 of a species of Plasmodium (Miller et al. # 1986, Science 
234:1349). 

In yet another embodiment, the Antigen comprises. a 
peptide of the fi subunit of Cholera toxin (Jacob et al., 
1983, Proc. Natl. Acad. Sci. USA 80:7611). 

10 Viruses expressing epitopes (antigenic 

determinants) that can be expressed by recombinant 
vesiculoviruses (wherein the foreign RNA directs the 
production of an antigen of the virus or a derivative thereof 
comprising an epitope thereof) include but are not limited to 

15 those listed in Table III, which lists such viruses by family 
for purposes of convenience and not limitation (see 1990, 
Fields Virology, 2d ed., Fields and Knipe (eds.), Raven 
Press, NY). 



20 



TABLE III 



VIRUSES EXPRESSING EPITOPES THAT CAN 
BE EXPRESSED BY RE COMBINANT VESICULOVIRUSES 

I. Picornaviridae 
25 Enteroviruses 

Poliovirus 
Coxsackievirus 
Echovirus 
Rhinoviruses 
Hepatitis A Virus 

30 II. Caliciviridae 

Norwalk group of viruses 

III. Togaviridae and Flaviviridae 

Togaviruses (e.g., Dengue virus) 
Alphaviruses 

Flaviviruses (e.g., Hepatitis C virus) 
Rubella virus 

35 

IV. Coronaviridae 

Coronaviruses 
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V. Rhabdoviridae 

Rabies virus 

VI. Filoviridae 

Marburg viruses 
Ebola viruses 

5 

VII . Paramyxoviridae 

Parainfluenza virus 
Mumps virus 
Measles virus 

Respiratory syncytial virus 

10 VIII, Orthomyxoviridae 

Orthomyxoviruses (e.g., Influenza virus) 

IX. Bunyaviridae 

Bunyaviruses 

X. Arenaviridae 

Arenaviruses 

15 

XI. Reoviridae 

Reoviruses 

Rotaviruses 

Orbiviruses 

- HI. Retroviridae 
2 0 Human T Cell Leukemia Virus type I 

Human T Cell Leukemia Virus type II 
Human Immunodeficiency Viruses (e.g., 

type I and type II) 
Simian Immunodeficiency Virus 
Lentiviruses 

XIII. Papoviridae 

25 Polyomaviruses 

Papillomaviruses 
Adenoviruses 

XIV. Parvoviridae 

Parvoviruses 

30 XV. Herpesviridae 

Herpes Simplex Viruses 
Epstein-Barr virus 
Cytomegalovirus 
Varicella-Zoster virus 
Human Herpesvirus-6 

Cercopithecine Herpes Virus 1 (B virus) 

35 XVI. Poxviridae 

Poxviruses 
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XVIII. Hepadnaviridae 

Hepatitis B virus 



In specific embodiments, the Antigen encoded by the 
foreign sequences that is expressed upon infection of a host 
by the recombinant vesiculovirus, displays the antigenicity 
or immunogenicity of an influenza virus hemagglutinin 
(Genbank accession no. J02132; Air, 1981, Proc. Natl. Acad. 
10 Sci. USA 78:7639-7643; Newton et al., 1983, Virology 
128:495-501); human respiratory syncytial virus G 
glycoprotein (Genbank accession no. Z33429; Garcia et al., 
1994 , i J . Virol.; Collins et al., 1984, Proc. Natl. Acad. 
Sci. USA 81:7683); core protein, matrix protein or other 
protein of Dengue virus (Genbank accession no. M19197; Hahn 
et al., 1988, Virology 162:167-180), measles virus 
hemagglutinin (Genbank accession no. M81899; Rota et al., 
1992, Virology 188:135-14?): and herpes simplex virus type 2 
glycoprotein gB (Genbank WeJssion no. M14923; Bzik et al. , 
20 1986, Virology 155:322-333). 

In another embodiment, one or more epitopes of the 
fusion protein of respiratory synctyial virus (RSV) can be 
expressed as an Antigen. 

Other Antigens that can be expressed by a 
recombinant vesiculovirus include but are not limited to 
those displaying the antigenicity or immunogenicity of the 
following antigens: Poliovirus I VPl (Emini et al., 1983, 
Nature 304:699); envelope glycoproteins of HIV I (Putney et 
al., 1986, Science 234:1392-1395) ; Hepatitis B surface 
antigen (Itoh et al., 1986, Nature 308:19; Neurath et al . , 
1986, Vaccine 4:34); Diptheria toxin (Audibert et al., 1981, 
Nature 289:543); streptococcus 24M epitope (Beachey, 1985, 
Adv. Exp. Med. Biol. 185:193); and gonococcal pilin (Rothbard 
and Schoolnik, 1985, Adv. Exp. Med. Biol. 185:247). 

In other embodiments, the Antigen expressed by the 
recombinant vesiculovirus displays the antigenicity or 
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immunogenicity of pseudorabies virus g50 (gpD) , pseudorabies 
virus II (gpB) , pseudorabies virus gill (gpC) , pseudorabies 
virus glycoprotein H, pseudorabies virus glycoprotein E, 
transmissible gastroenteritis glycoprotein 195, transmissible 
5 gastroenteritis matrix protein, swine rotavirus glycoprotein 
38 f swine parvovirus capsid protein, Serpulina 
hydodysenteriae protective antigen, Bovine Viral Diarrhea 
glycoprotein 55, Newcastle Disease Virus hemagglutinin- 
neuraminidase, swine flu hemagglutinin, or swine flu 

10 neuraminidase. 

In various embodiments, the Antigen expressed by 
the recombinant vesiculovirus displays the antigenicity or 
immunogenicity of an antigen derived from Serpulina 
hyodysenteriae , Foot and Mouth Disease Virus, Hog Colera 

15 Virus, swine influenza virus, African Swine Fever Virus, 
Mycoplasma hyopneumoniae , infectious bovine rhinotracheitis 
virus (e.g., infectious bovine rhinotracheitis virus 
glycoprotein E or glycoprotein G) , or infectious 
r i^ryngotracheitis virus (e.g., infectious laryngotracheal* 1 

2 0 virus glycoprotein G or glycoprotein I) . 

In another embodiment, the Antigen displays the 
antigenicity or immunogenicity of a glycoprotein of La Crosse 
Virus (Gonzales-Scarano et al., 1982, Virology 120:42), 
Neonatal Calf Diarrhea Virus (Matsuno and Inouye, 1983, 

25 Infection and Immunity 39:155), Venezuelan Equine 

Encephalomyelitis Virus (Mathews and Roehrig, 1982, J. 
Immunol. 129:2763), Punta Toro Virus (Dalrymple et al., 1981, 
in Replication of Negative Strand Viruses, Bishop and Compans 
(eds.), Elsevier, NY, p. 167), Murine Leukemia Virus (Steeves 

30 et al., 1974, J. Virol. 14:187), or Mouse Mammary Tumor Virus 
(Massey and Schochetman, 1981, Virology 115:20). - 

In another embodiment, the Antigen displays the 
antigenicity or immunogenicity of an antigen of a human 
pathogen, including but not limited to human herpesvirus, 

35 herpes simplex virus-1, herpes simplex virus-2, human 

cytomegalovirus, Epstein-Barr virus, Varicella-Zoster virus, 
human herpesvirus-6 , human herpesvirus-7 , human influenza 
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virus, human immunodeficiency virus, rabies virus, measles 
virus, hepatitis B virus, hepatitis C virus, Plasmodium 
falciparum, and Bordetella pertussis . 

In a specific embodiment of the invention, a 
5 recombinant vesiculovirus expresses hepatitis B virus core 
protein and/or hepatitis B virus surface antigen or a 
fragment or derivative thereof (see, e.g., U.K. Patent 
Publication No. GB 2034323A published June 4, 1980; Ganem and 
Varmus, 1987, Ann. Rev. Biochem. 56:651-693; Tiollais et al., 

10 1985, Nature 317:489-495). The HBV genome (subtype adw) is 
contained in plasmid pAM6 (Moriarty et al., 1981, Proc. Natl. 
Acad. Sci. USA 78:2606-2610, available from the American Type 
Culture Collection (ATCC) , Accession No. 45020), a 
pBR322-based vector that is replicable in JET . coli. 

15 In another embodiment, the Antigen expressed by the 

recombinant vesiculovirus displays the antigenicity or 
immunogenicity of an antigen of equine influenza virus or 
equine herpesvirus. Examples of such antigens are equine 
influenza virus type A/AlaLJvci ^^-neuraminidase, equine 

20 influenza virus type A/Miami 63 neuraminidase, equine 
influenza virus type A/Kentucky 81 neuraminidase equine 
herpesvirus type 1 glycoprotein B, and equine herpesvirus 
type 1 glycoprotein D. 

In another embodiment, the Antigen displays the 

25 antigenicity or immunogenicity of an antigen of bovine 
respiratory syncytial virus or bovine parainfluenza virus. 
For example, such antigens include but are not limited to 
bovine respiratory syncytial virus attachment protein 
(BRSV G) , bovine respiratory syncytial virus fusion protein 

30 (BRSV F) , bovine respiratory syncytial virus nucleocapsid 
protein (BRSV N) , bovine parainfluenza virus type 3 fusion 
protein, and the bovine parainfluenza virus type 3 
hemagglutinin neuraminidase. 

In another embodiment, the Antigen displays the 

35 antigenicity or immunogenicity of bovine viral diarrhea virus 
glycoprotein 48 or glycoprotein 53. 
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In another embodiment, the Antigen displays the 
antigenicity or immunogenicity of an antigen of infectious 
bursal disease virus. Examples of such antigens are 
infectious bursal disease virus polyprotein and VP2. 
5 potentially useful antigens or derivatives thereof 

for use as Antigens expressed by recombinant vesiculoviruses 
can be identified by various criteria, such as the antigen's 
involvement in neutralization of a pathogen's infectivity 
(Norrby, 1985, Summary, in Vaccines85, Lerner et al. (eds.), 

10 Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 
pp. 388-389) , type or group specificity, recognition by 
patients' antisera or immune cells, and/or the demonstration 
of protective effects of antisera or immune cells specific 
for the antigen. In addition, the antigen's encoded epitope 

15 should preferably display a small or no degree of antigenic 
variation in time or amongst different isolates of the same 
pathogen . 

In a preferred embodiment, the foreign DNA inserted 
*Tyfcd +■ .ie vesiculovirus (— ) DNA encodes an immunopotent 

20 dominant epitope of a pathogen. Foreign DNA encoding 

epitopes which are reactive with antibody although incapable 
of eliciting immune responses, still have potential uses in 
immunoassays (see Section 5.8, infra). 

In another embodiment, foreign RNA of the 

25 recombinant vesiculovirus directs the production of an 
Antigen comprising an epitope, which when the recombinant 
vesiculovirus is introduced into a desired host, induces an 
immune response that protects against a condition or disorder 
caused by an entity containing the epitope. For example, the 

30 Antigen can be a tumor specific antigen or tumor-associated 
antigen, for induction of a protective immune response 
against a tumor (e.g., a malignant tumor). Such tumor- 
specific or tumor-associated antigens include but are not 
limited to KS 1/4 pan-carcinoma antigen (Perez and Walker, 

35 1990, J. Immunol. 142:3662-3667; Bumal, 1988, Hybridoma 

7 (4): 407-415) ; ovarian carcinoma antigen (CA125) (Yu et al., 
1991, Cancer Res. 51 (2 ): 4 68-475) ; prostatic acid phosphate 
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(Tailor et al., 1990, Nucl. Acids Res. 18 ( 16) : 4928) ; prostate 
specific antigen (Henttu and Vihko, 198 9, Biochem. Biophys. 
Res. Comm. 160 (2) : 903-910 ; Israeli et al., 1993, Cancer Res. 
53:227-23 0; melanoma-associated antigen p97 (Estin et al., 
5 1989, J. Natl. Cancer Instit. 81 (6) : 445-446) ; melanoma 
antigen gp75 (Vi jayasardahl et al., 1990, J. Exp. Med. 
171 (4) : 1375-1380) ; high molecular weight melanoma antigen 
(Natali et al., 1987, Cancer 59:55-63); and prostate specific 
membrane antigen. 

10 In another embodiment of the invention, the Antigen 

expressed by the recombinant vesiculovirus comprises large 
regions of proteins which contain several B cell epitopes 
(i.e., epitopes capable of enticing a humoral immune 
response) and T cell epitopes (i.e., epitopes capable of 

15 inducing a cell-mediated immune response) . 

Peptides or proteins which are known to contain 
antigenic determinants can be used as the Antigen. If 
specific desired antigens are unknown, identification and 
characterization of iinmunoreactiv^.seqiiences can be carried 

20 out. One way in which to accomplish this is through the use 
of monoclonal antibodies generated to the surface or other 
molecules of a pathogen or tumor, as the case may be. The 
peptide sequences capable of being recognized by the 
antibodies are defined epitopes. Alternatively, small 

25 synthetic peptides conjugated to carrier molecules can be 
tested for generation of monoclonal antibodies that bind to 
the sites corresponding to the peptide, on the intact 
molecule (see, e.g., Wilson et al. , 1984, Cell 37:767). 

In a specific embodiment, appropriate Antigens, 

30 including fragments or derivatives of known antigens, can be 
identified by virtue of their hydrophilicity , by carrying out 
a hydrophilicity analysis (Hopp and Woods, 1981, Proc. Natl. 
Acad. Sci. USA 78:3824) to generate a hydrophilicity profile. 
A hydrophilicity profile can be used to identify the 

35 hydrophobic and hydrophilic regions of a protein and the 

corresponding regions of the gene sequence which encode such 
proteins. Hydrophilic regions are predicted to be 
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, immunogenic/ antigenic. Other methods known in the art which 
may be employed for the identification and characterization 
of antigenic determinants are also within the scope of the 
invention. 

5 The foreign DNA encoding the Antigen, that is 

inserted into a non-essential site of the vesiculovirus (-) 
DNA, optionally can further comprise a foreign DNA sequence 
encoding a cytokine capable of being expressed and 
stimulating an immune response in a host infected by the 

10 recombinant vesiculovirus. For example, such cytokines 

include but are not limited to interleukin-2 , inter leukin-6, 
inter leukin-12 , interferons, granulocyte-macrophage colony 
stimulating factors, and inter leukin receptors. 

The foreign DNA optionally can further comprise a 

15 sequence encoding and capable of expressing a detectable 
marker (e.g., f3 galactosidase) . 

5.3, CONSTRUCTION OF VESICULOVIRUS (-) DNA 
CONTAINING FOREIGN DNA 

20 For initial production of a recombinant 

vesiculovirus, the foreign DNA comprising a sequence encoding 
the desired antigen is inserted into and/or replaces a region 
of the vesiculovirus (-) DNA nonessential for replication. 
Many strategies known in the art can be used in the 

25 construction of the vesiculovirus (-) DNA containing the 
foreign DNA. For example, the relevant sequences of the 
foreign DNA and of the vesiculovirus (-) DNA can, by 
techniques known in the art, be cleaved at appropriate sites 
with restriction endonuclease(s) , isolated, and ligated in 

30 vitro* If cohesive termini are generated by restriction 

endonuclease digestion, no further modification of DNA before 
ligation may be needed. If, however, cohesive termini of the 
DNA are not available for generation by restriction 
endonuclease digestion, or different sites other than those 

35 available are preferred, any of numerous techniques known in 
the art may be used to accomplish ligation of the 
heterologous DNA at the desired sites. In a preferred 
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embodiment, a desired restriction enzyme site is readily 
introduced into the desired DNA by amplification of the DNA 
by use of PCR with primers containing the restriction enzyme 
site. By way of another example, cleavage with a restriction 
5 enzyme can be followed by modification to create blunt ends 
by digesting back or filling in single-stranded DNA termini 
before ligation. Alternatively, the cleaved ends of the 
vesiculovirus (-) DNA or foreign DNA can be "chewed back 1 ' 
using a nuclease such as nuclease Bal 31, exonuclease III, 

10 lambda exonuclease, iming bean nuclease, or T4 DNA polymerase 
exonuclease activity, to name but a few, in order to remove 
portions of the sequence. 

To facilitate insertion of the foreign DNA, an 
oligonucleotide sequence (a linker) which encodes one or more 

15 restriction sites can be inserted in a region of the 

vesiculovirus (-) DNA (see, e.g., the polylinker in pVSVSSl, 
Fig. 2) by ligation to DNA termini. A linker may also be 
used to generate suitable restriction sites in the foreign 
DNA sequence . ." . ■ /y^;/- 

20 Additionally, vesiculovirus (-) DNA or foreign DNA 

sequences can be mutated In vitro or In vivo in order to form 
new restriction endonuclease sites or destroy preexisting 
ones, to facilitate In vitro ligation procedures. Any 
technique for mutagenesis known in the art can be used, 

25 including but not limited to, In vitro site-directed 
mutagenesis (Hutchinson et al. , 1978, J. Biol. Chem. 
253:6551), chemical mutagenesis, etc. 

Sequences of the vesiculovirus (-) DNA that have 
been undesirably modified by such In vitro manipulations can 

30 be "restored," if desired, by introduction of appropriate 
sequences at the desired sites. 

The particular strategy for inserting the foreign 
DNA will depend on the specific vesiculovirus (-) DNA site to 
be replaced or inserted into, as well as the foreign DNA to 

35 be inserted. 
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The sequences encoding the immunogenic peptides or 
proteins are preferably present in single copies, but can 
also be present in multiple copies within the virus genome. 

Formation of the desired vesiculovirus (-) DNA 
5 containing the foreign DNA can be confirmed by standard 
methods such as DNA sequence analysis, hybridization 
analysis, and/or restriction mapping, using methods well 
known in the art. 

Foreign DNA encoding a desired antigen can be 
10 obtained from any of numerous sources such as cloned DNA, 

genomic DNA, or cDNA made from RNA of the desired pathogen or 
tumor, as the case may be, or chemically synthesized DNA, and 
manipulated by recombinant DNA methodology well known in the 
art (see Sambrook et al. , 1991, Molecular Cloning, A 
15 Laboratory Manual, 2d ed. , Cold Spring Harbor Laboratory 
Press, New York). In a preferred embodiment, polymerase 
chain reaction (PCR) is used to amplify the desired fragment 
of foreign DNA from among a crude preparation of DNA or a 
small s?mp4*e rf the DNA, by standard methods. Appropriate 
20 primers for use in PCR can be readily deduced based on 
published sequences. 

In order to generate appropriate DNA fragments, the 
DNA (e.g., from the pathogen or tumor of interest) may be 
cleaved at specific sites using various restriction enzymes. 
25 Alternatively, one may use DNasel in the presence of 

manganese, or mung bean nuclease (McCutchan et al., 1984, 
Science 225:626), to fragment the DNA, or the DNA can be 
physically sheared, as for example, by sonication. The 
linear DNA fragments can then be separated according to size 
30 by standard techniques, including, but not limited to, 
agarose and polyacrylamide gel electrophoresis and column 
chromatography. 

PCR amplification of DNA fragments containing the 
desired epitope (s) is most preferably carried out, in which 
35 the PCR primers contain and thus introduce into the amplified 
DNA a desired restriction enzyme recognition site. 
Alternatively, any restriction enzyme or combination of 
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restriction enzymes may be used to generate DNA fragment(s) 
containing the desired epitope (s) , provided the enzymes do 
not destroy the immunopotency of the encoded product. 
Consequently, many restriction enzyme combinations may be 
5 used to generate DNA fragments which, when inserted into the 
vesiculovirus (-) DNA, are capable of producing recombinant 
vesiculoviruses that direct the production of the peptide 
containing the epitope (s ) . 

Once the DNA fragments are generated, 
10 identification of the specific fragment containing the 
desired sequence may be accomplished in a number of ways. 
For example, if a small amount of the desired DNA sequence or 
a homologous sequence is previously available, it can be used 
as a labeled probe (e.g., nick translated) to detect the DNA 
15 fragment containing the desired sequence, by nucleic acid 

hybridization. Alternatively, if the sequence of the derived 
gene or gene fragment is known, isolated fragments or 
portions thereof can be sequenced by methods known in the 
-■■art-, airti identified by a comparison .~f the derived sequence 
20 to that of the known DNA or protein sequence. Alternatively, 
the desired fragment can be identified by techniques 
including but not limited to mRNA selection, making cDNA to 
the identified mRNA, chemically synthesizing the gene 
sequence (provided the sequence is known) , or selection on 
25 the basis of expression of the encoded protein (e.g., by 
antibody binding) after "shotgun cloning" of various DNA 
fragments into an expression system. 

The sequences encoding peptides to be expressed in 
recombinant vesiculoviruses according to the present 
30 invention, whether produced by recombinant DNA methods, 

chemical synthesis, or purification techniques, include but 
are not limited to sequences encoding all or part (fragments) 
of the amino acid sequences of pathogen-specific and 
tumor-specific antigens, as well as other derivatives and 
35 analogs thereof displaying the antigenicity or immunogenicity 
thereof. Derivatives or analogs of antigens can be tested 
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for the desired activity by procedures known in the art, 
including but not limited to standard immunoassays. 

In particular, antigen derivatives can be made by 
altering the encoding antigen nucleotide sequences by 
5 substitutions, additions or deletions that do not destroy the 
antigenicity or immunogenicity of the antigen. For example, 
due to the degeneracy of nucleotide coding sequences, other 
DNA sequences which encode substantially the same amino acid 
sequence as a native antigen gene or portion thereof may be 
10 used in the practice of the present invention. Other 
examples may include but are not limited to nucleotide 
sequences comprising all or portions of genes or cDNAs which 
are altered by the substitution of different codons that 
encode a functionally equivalent amino acid residue within 
15 the sequence, thus producing a silent change. For example, 
one or more amino acid residues within the sequence can be 
substituted by another amino acid of a similar polarity which 
acts as a functional equivalent, resulting in a silent 
alteration. Substitutes for an amino acid within the 
20 sequence may be selected from other members of the class to 
which the amino acid belongs. For example, the nonpolar 
(hydrophobic) amino acids include alanine, leucine, 
isoleucine, valine, proline, phenylalanine, tryptophan, and 
methionine. The polar neutral amino acids include glycine, 
25 serine, threonine, cysteine, tyrosine, asparagine, and 
glutamine. The positively charged (basic) amino acids 
include arginine, lysine, and histidine. The negatively 
charged (acidic) amino acids include aspartic and glutamic 
acid. 

30 The antigen derivatives and analogs can be produced 

by various methods known in the art. For example, a cloned 
gene sequence can be modified by any of numerous strategies 
known in the art (Sambrook et al., 1989, Molecular Cloning, A 
Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, 

35 Cold Spring Harbor, New York). The sequence can be cleaved 
at appropriate sites with restriction endonuclease (s) , 
followed by further enzymatic modification if desired, 
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isolated, and ligated in vitro. In the production of the 
gene encoding a derivative or analog of an antigen, care 
should be taken to ensure that the modified gene remains 
within the same translational reading frame as the antigen, 
5 uninterrupted by translational stop signals, in the gene 
region where the desired epitope (s) are encoded. 

Additionally, the antigen-encoding nucleic acid 
sequence can be mutated in vitro or in vivo , to create and /or 
destroy translation, initiation, and/or termination 
10 sequences, or to create variations in coding regions and/or 
form new restriction endonuclease sites or destroy 
preexisting ones, to facilitate further in vitro 
modification. Any technique for mutagenesis known in the art 
can be used, including but not limited to, in vitro site- 
15 directed mutagenesis (Hutchinson, C. , et al., 1978, J. Biol. 
Chem 253:6551), use of TAB® linkers (Pharmacia), etc. 

In another specific embodiment, the encoded antigen 
derivative is a chimeric, or fusion, protein comprising a 
firsi. prcteiii or fragment thereof fused >o ^. senchd, 
20 different amino acid sequence. Such a chimeric protein is 
encoded by a chimeric nucleic acid in which the two coding 
sequences are joined inframe. Such a chimeric product can be 
made by ligating the appropriate nucleic acid sequences 
encoding the desired amino acid sequences to each other by 
25 methods known in the art, in the proper coding frame. In a 
specific embodiment, a fusion protein is produced in which 
the first protein sequence contains an epitope of an antigen, 
and the second protein sequence contains an epitope of a 
different antigen. 

30 Derivatives and fragments of known antigens can be 

readily tested by standard immunoassay techniques to 
ascertain if they display the desired immunogenicity or 
antigenicity, rendering a DNA sequence encoding such a 
fragment or derivative suitable for insertion into the 

35 vesiculovirus (-) DNA. 

A DNA sequence encoding an epitope that is a 
hapten, i.e., a molecule that is antigenic in that it can 
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react selectively with cognate antibodies, but not 
immunogenic in that it cannot elicit an immune response when 
administered without adjuvants or carrier proteins, can also 
be isolated for use, since it is envisioned that, in 
5 particular embodiments, presentation by the vesiculoviruses 
of the invention can confer immunogenicity to the hapten 
expressed by the virus. 

Once identified and isolated, the foreign DNA 
containing the seguence(s) of interest is then inserted into 
10 the vesiculovirus (-) DNA, for production of a recombinant 
vesiculovirus, 

5 - 4 - PRODUCTI ON OF RECOMBINANT VESICULOVIRUSES 

The recombinant vesiculoviruses of the invention 
15 are produced by providing in an appropriate host cell: 
vesiculovirus (-) DNA, in which regions nonessential for 
replication have been inserted into or replaced by foreign 
DNA comprising a sequence encoding an Antigen,, and 
recombinant -'s-prcos of vesiculovirus N protein, P protein, 
20 and L protein. The production is preferably In vitro, in 
cell culture. 

The host cell used for recombinant vesiculovirus 
production can be any cell in which vesiculoviruses grow, 
e.g., mammalian cells and some insect (e.g., Drosophila) 

25 cells. Primary cells, or more preferably, cell lines can be 
used. A vast number of cell lines commonly known in the art 
are available for use. By way of example, such cell lines 
include but are not limited to BHK (baby hamster kidney) 
cells, CHO (Chinese hamster ovary) cells, HeLA (human) cells, 

30 mouse L cells, Vero (monkey) cells, ESK-4, PK-15, EMSK cells, 
MDCK (Madin-Darby canine kidney) cells, MDBK (Madin-Darby 
bovine kidney) cells, 293 (human) cells, and Hep-2 cells. 

The sources of N, P, and L proteins can be the same 
or different recombinant nucleic acid(s), encoding and 

35 capable of expressing the N, P and L proteins in the host 
cell in which it is desired to produce recombinant 
vesiculovirus. 
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The nucleic acids encoding the N, P and L proteins 
are obtained by any means available in the art. The N, P and 
L nucleic acid sequences have been disclosed and can be used. 
For example, see Genbank accession no. J02428; Rose and 
5 Schubert, 1987, in The Viruses: The Rhabdovlruses , Plenum 
Press, NY, pp. 129-166. The sequences encoding the N, P and 
L genes can also be obtained from plasmid pVSVFL(+) , 
deposited with the ATCC and assigned accession no. 97134, 
e.g., by PCR amplification of the desired gene (PCR; U.S. 
10 Patent Nos. 4,683,202, 4,683,195 and 4,889,818; Gyllenstein 
et al., 1988, Proc. Natl. Acad. Sci. USA 85:7652-7656; Ochman 
et al., 1988, Genetics 120:621-623; Loh et al., 1989, Science 
243:217-220). If a nucleic acid clone of any of the N, P or 
L genes is not already available, the clone can be obtained 
15 by use of standard recombinant DNA methodology. For example, 
the DNA may be obtained by standard procedures known in the 
art by purification of RNA from vesiculoviruses followed by 
reverse transcription and polymerase chain reaction (Mullis 
and Falooiid , 1987 , Methods in Enzymology 15^: 335^3Bo; . 
20 Alternatives to isolating an N, P or L gene include, but are 
not limited to, chemically synthesizing the gene sequence 
itself. Other methods are possible and within the scope of 
the invention. 

If desired, the identified and isolated gene can 
25 then optimally be inserted into an appropriate cloning vector 
prior to transfer to an expression vector. 

Nucleic acids that encode derivatives (including 
fragments) and analogs of native N, P and L genes, as well as 
derivatives and analogs of the vesiculovirus (-) DNA can also 
30 be used in the present invention, as long as such derivatives 
and analogs retain function, as exemplified by the ability 
when used according to the invention to produce a replicable 
vesiculovirus containing a genomic RNA containing foreign 
RNA. In particular, derivatives can be made by altering 
35 sequences by substitutions, additions, or deletions that 

provide for functionally active molecules. Furthermore, due 
to the inherent degeneracy of nucleotide coding sequences, 
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other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be used in 
the practice of the methods of the invention. Amino acid 
substitutions may be made on the basis of similarity in 
5 polarity, charge, solubility, s hydrophobicity, hydrophilicity 
and/or the amphipathic nature of the residues involved. 

The desired N/P/L-encoding nucleic acid is then 
preferably inserted into an appropriate expression vector, 
i.e., a vector which contains the necessary elements for the 
10 transcription and translation of the inserted protein-coding 
sequence in the host in which it is desired to produce 
recombinant vesiculovirus, to create a vector that functions 
to direct the synthesis of the N/P/L protein that will 
subsequently assemble with the vesiculovirus genomic RNA 
15 containing the foreign sequence (produced in the host cell 
from antigenomic vesiculovirus (+) RNA produced by 
transcription of the vesiculovirus (-) DNA) . A variety of 
vector systems may be utilized to express the N, P and 
L-coding seguenc -sc. as y* n a s to transcribe the 
20 vesiculovirus (-) DNA containing the foreign DNA, as long as 
the vector is functional in the host and compatible with any 
other vector present. Such vectors include but are not 
limited to bacteriophages; plasmids, or cosmids. In a 
preferred aspect, a plasmid expression vector is used. The 
25 expression elements of vectors vary in their strengths and 
specificities. Any one of a number of suitable transcription 
and translation elements may be used, as long as they are 
functional in the host. 

Standard recombinant DNA methods may be used to 
30 construct expression vectors containing DNA encoding the N, 
P, and L proteins, and the vesiculovirus (-) DNA containing 
the foreign DNA, comprising appropriate transcriptional/ 
translational control signals (see, e.g., Sambrook et al. , 
1989, supra, and methods described hereinabove). 
35 (Translational control signals are not needed for 

transcription of the vesiculovirus (-) DNA, and thus may be 
omitted from a vector containing the vesiculovirus (-) DNA, 
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although such signals may be present in the vector and 
operably linked to other sequences encoding a protein which 
it is desired to express). Expression may be controlled by 
any promoter/ enhancer element known in the art. Promoters 
5 which may be used to control expression can be constitutive 
or inducible. In a specific embodiment, the promoter is an 
RNA polymerase promoter. 

Transcription termination signals (downstream of 
the gene) , and selectable markers are preferably also 
10 included in a plasmid expression vector. In addition to 
promoter sequences, expression vectors for the N, P, and L 
proteins preferably contain specific initiation signals for 
efficient translation of inserted N/P/L sequences, e.g., a 
ribosome binding site. 

15 Specific initiation signals are required for 

efficient translation of inserted protein coding sequences. 
These signals include the ATG initiation codon and adjacent 
sequences. In cases where the entire N, P, or L gene 
including its own initiation codon and adjacent sequences are 

20 inserted into the appropriate vectors, no additional 

translational control signals may be needed. However, in 
cases where only a portion of the gene sequence is inserted, 
exogenous translational control signals, including the ATG 
initiation codon, must be provided. The initiation codon 

25 must furthermore be in phase with the reading frame of the 
protein coding sequences to ensure translation of the entire 
insert. These exogenous translational control signals and 
initiation codons can be of a variety of origins, both 
natural and synthetic. 

30 In a specific embodiment, a recombinant expression 

vector provided by the invention, encoding an N, P, and /or L 
protein or functional derivative thereof, comprises the 
following operatively linked components: a promoter which 
controls the expression of the N, P, or L protein or 

35 functional derivative thereof, a translation initiation 
signal, a DNA sequence encoding the N, P or L protein or 
functional derivative thereof, and a transcription 
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termination signal. In a preferred aspect, the above 
components are present in 5' to 3' order as listed above. 

In another specific embodiment, the gene encoding 
the N, P, or L protein is inserted downstream of the T7 RNA 
5 polymerase promoter from phage T7 gene 10, situated with an A 
in the -3 position. A T7 RNA polymerase terminator and a 
replicon are also included in the expression vector. In this 
embodiment, T7 RNA polymerase is provided to transcribe the 
N/P/L sequence. The T7 RNA polymerase can be produced from a 

10 chromosomally integrated sequence or episomally, and is most 
preferably provided by intracellular expression from a 
recombinant vaccinia virus encoding the T7 RNA polymerase 
(see infra). Preferably, the N, P, and L proteins are each 
encoded by a DNA sequence operably linked to a promoter in an 

15 expression plasmid, containing the necessary regulatory 

signals for transcription and translation of the N, P, and L 
proteins. Such an expression plasmid preferably includes a 
promoter, the coding sequence, and a transcription 
terminatio>/polyj:Jic:)yj.ation signal, and option?/ ly.. a 

20 selectable marker (e.g., 0-galactosidase) . The N, P and L 
proteins can be encoded by the same or different plasmids, or 
a combination thereof, and preferably are in different 
plasmids. Less preferably, one or more of the N, P, and L 
proteins can be expressed intrachromosomally . 

25 The cloned sequences comprising the vesiculovirus 

(-) DNA containing the foreign DNA, and the cloned sequences 
comprising sequences encoding the N, P, and L proteins can be 
introduced into the desired host cell by any method known in 
the art, e.g., transfection, electroporation, infection (when 

30 the sequences are contained in, e.g., a viral vector), 
microin j ection , etc . 

In a. preferred embodiment, DNA comprising 
vesiculovirus (-) DNA containing foreign DNA encoding an 
Antigen, operably linked to an RNA polymerase promoter 

35 (preferably a bacteriophage RNA polymerase promoter) ; DNA 
encoding N, operably linked to the same RNA polymerase 
promoter; DNA encoding P, operably linked to the same 
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polymerase promoter; and DNA encoding L, operably linked to 
the same polymerase promoter; are all introduced (preferably 
by transfection) into the same host cell, in which host cell 
the RNA polymerase has been cytoplasmically provided. The 
5 RNA polymerase is cytoplasmically provided preferably by 
expression from a recombinant virus that replicates in the 
cytoplasm and expresses the RNA polymerase, most preferably a 
vaccinia virus (see the section hereinbelow) , that has been 
introduced (e.g., by infection) into the same host cell. 

10 Cytoplasmic provision of RNA polymerase is preferred, since 
this will result in cytoplasmic transcription and processing, 
of the VSV (-) DNA comprising the foreign DNA and of the N, P 
and L proteins, avoiding splicing machinery in the cell 
nucleus, and thus maximizing proper processing and production 

15 of N, P and L proteins, and resulting assembly of the 

recombinant vesiculovirus. For example, vaccinia virus also 
cytoplasmically provides enzymes for processing (capping and 
polyadenylation) of rnRNA, facilitating proper translation. 
In a most preferred aspect, T7 RNA polymerase promote! 3 are 

20 employed, and a cytoplasmic source of T7 RNA polymerase is 
provided by also introducing into the host cell a recombinant 
vaccinia virus encoding T7 RNA polymerase into the host cell. 
Such vaccinia viruses can be obtained by well known methods 
(see section 5.5, infra) . In a preferred aspect, a 

25 recombinant vaccinia virus such as VTF7-3 (Fuerst et al., 
1986, Proc. Natl. Acad. Sci. U.S.A. 83:8122-8126) can be 
used. In a most preferred aspect, the DNA comprising 
vesiculovirus (-) DNA containing foreign DNA is plasmid 
pVSVSSl in which foreign DNA has been inserted into the 

30 polylinker region. 

Alternatively, but less preferably, the RNA 
polymerase (e.g., T7 RNA polymerase) can be provided by use 
of a host cell that expresses T7 RNA polymerase from a 
chromosomally integrated sequence (e.g., originally inserted 

35 into the chromosome by homologous recombination) , preferably 
constitutively , or that expresses T7 RNA polymerase 
episomally, from a plasmid. 
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In another, less preferred, embodiment, the VSV (-) 
DNA encoding an Antigen, operably linked to a promoter, can 
be transfected into a host cell that stably recombinantly 
expresses the N, P, and L proteins from chromosomally 
5 integrated sequences. 

The cells are cultured and recombinant 
vesiculovirus is recovered, by standard methods. For 
example, and not by limitation, after approximately 24 hours, 
cells and medium are collected, f reeze-thawed, and the 
10 lysates clarified to yield virus preparations. 

Alternatively, the cells and medium are collected and simply 
cleared of cells and debris by low-speed centrifugation. 

Confirmation that the appropriate foreign sequence 
is present in the genome of the recombinant vesiculovirus and 
15 directs the production of the desired protein (s) in an 
infected cell, is then preferably carried out. Standard 
procedures known in the art can be used for this purpose. 
For example, genomic RNA is obtained from the vesiculovirus 
by SDS phenol ^xtL-actiwiV f rum virus preparations, : nd can b< 
20 subjected to reverse transcription (and PCR, if desired) , 
followed by sequencing, Southern hybridization using a probe 
specific to the foreign DNA, or restriction enzyme mapping, 
etc. The virus can be used to infect host cells, which can 
then be assayed for expression of the desired protein by 
25 standard immunoassay techniques using an antibody to the 
protein, or by assays based on functional activity of the 
protein. Other techniques are known in the art and can be 
used. 

The invention also provides kits for production of 
30 recombinant vesiculoviruses. In one embodiment, the kit 
comprises in one or more (and most preferably, in separate) 
containers: (a) a first recombinant DNA that can be 
transcribed in a suitable host cell to produce a 
vesiculovirus antigenomic ( + ) RNA in which a portion of the 
35 RNA nonessential for replication of the vesiculovirus has 

been inserted into or replaced by a foreign RNA sequence; (b) 
a second recombinant DNA comprising a sequence encoding a 
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vesiculovirus N protein; (c) a third recombinant DNA 
comprising a sequence encoding a vesiculovirus L protein; and 
(d) a fourth recombinant DNA comprising a sequence encoding a 
vesiculovirus P protein. The second, third and fourth 
5 recombinant DNAs can be part of the same or different DNA 
molecules. In a preferred embodiment, the sequences encoding 
the N, L, and P proteins are each operably linked to a 
promoter that controls expression of the N, L, and P 
proteins, respectively, in the suitable host cell. In 
10 various embodiments, the kit can contain the various nucleic 
acids, e.g., plasmid expression vectors, described 
hereinabove for use in production of recombinant 
vesiculoviruses. 

In another embodiment, a kit of the invention 
15 comprises (a) a first recombinant DNA that can be transcribed 
in a suitable host cell to produce a vesiculovirus 
antigenomic DNA in which a portion of the RNA nonessential 
for replication of the vesiculovirus has been inserted into 
or replaced by a foreign RNA sequence; and (b; a host call 
2 0 that recombinant ly expresses vesiculovirus N, P and L 
proteins. 

In a preferred embodiment, a kit of the invention 
comprises in separate containers: 

(a) a first plasmid comprising the following 
25 operatively linked components: (i) a bacteriophage RNA 

polymerase promoter, (ii) a DNA comprising a sequence capable 
of being transcribed in a suitable host cell to produce an 
RNA molecule comprising a vesiculovirus antigenomic RNA in 
which a portion of the RNA nonessential for replication of 

30 the vesiculovirus has been inserted into or replaced by a 
foreign RNA sequence, and in which the 3' end of the 
antigenomic RNA is immediately adjacent to a ribozyme 
sequence that cleaves at the 3' end of the antigenomic RNA, 
and (iii) a transcriptional termination signal for the 

35 bacteriophage RNA polymerase; and 

(b) a second plasmid comprising the following 
operatively linked components: (i) the bacteriophage RNA 
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polymerase promoter, (ii) a DNA comprising a sequence 
encoding the vesiculovirus N protein, and (ii) a 
transcriptional termination signal for the bacteriophage RNA 
polymerase; and 

n5 (c) a third plasmid comprising the following 

operatively linked components: (i) the bacteriophage RNA 
polymerase promoter, (ii) a DNA comprising a sequence 
encoding the vesiculovirus P protein, and (ii) a 
transcriptional termination signal for the bacteriophage RNA 

10 polymerase; and 

(d) a fourth plasmid comprising the following 
operatively linked components: (i) the bacteriophage RNA 
polymerase promoter, (ii) a DNA comprising a sequence 
encoding the vesiculovirus L protein, and (ii) a 
15 transcriptional termination signal for the bacteriophage RNA 
polymerase. 

. . In another embodiment, a kit of the invention 
further comprises in a separate container a recombinant 
vaccinia virus encoding and ccrpable of expressing the 
2 0 bacteriophage RNA polymerase. 

In a preferred embodiment, the components in the 
containers are in purified form. 

5.4.1. RECOMBINANT VACCINIA VIRUSES 
25 ENCODING AND CAPABLE OF 

EXPRESSING FOREIGN RNA POLYMERASES 

In a preferred aspect of the invention, 
transcription of the vesiculovirus (-) DNA containing the 
foreign DNA encoding an Antigen, and/or transcription of the 
DNA encoding the N, P, and L proteins in the host cell, is 

30 

controlled by an RNA polymerase promoter (preferably one in 
which the RNA polymerase is not endogenous to the host cell) , 
and the RNA polymerase (that initiates transcription from the 
promoter) is recombinantly provided in the host cell by 
expression from a recombinant vaccinia virus. DNA sequences 

35 

encoding RNA polymerases are well known and available in the 
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art and can be used. For example, phage DNA can be obtained 
and PCR used to amplify the desired polymerase gene. 

Insertion of the desired recombinant DNA sequence 
encoding and capable of expressing the RNA polymerase into a 
5 vaccinia virus for expression by the vaccinia virus is 

preferably accomplished by first inserting the DNA sequence 
into a plasmid vector which is capable of subsequent transfer 
to a vaccinia virus genome by homologous recombination. 
Thus, in a preferred aspect of the invention for constructing 

10 the recombinant vaccinia viruses, the desired DNA sequence 
encoding the polymerase is inserted, using recombinant DNA 
methodology (see Sambrook et al., 1989, Molecular Cloning, A 
Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor, New York) into an insertion (preferably, 

is plasmid) vector flanked by (preferably) nonessential vaccinia 
DNA sequences, thus providing for subsequent transfer of its 
chimeric gene(s) into vaccinia virus by homologous 
recombination. The sequences are placed in the vector such 
that they can be expressed under the control of a - prcLuuer 

20 functional in vaccinia virus. 

Expression of foreign DNA in recombinant vaccinia 
viruses requires the positioning of promoters functional in 
vaccinia so as to direct the expression of the protein-coding 
polymerase DNA sequences. Plasmid insertion vectors have 

25 been constructed to insert chimeric genes into vaccinia virus 
for expression therein. Examples of such vectors are 
described by Mackett (Mackett et al., 1984. J. Virol. 49:857- 
864). The DNA encoding the polymerase is inserted into a 
suitable restriction endpnuclease cloning site. In addition 

30 to plasmid insertion vectors, insertion vectors based on 
single-stranded M13 bacteriophage DNA (Wilson et al., 1986, 
Gene 49:2 07-213) can be used. 

The inserted polymerase DNA should preferably not 
contain introns, and insertion should preferably be so as to 

35 place the coding sequences in close proximity to the 

promoter, with no other start codons in between the initiator 
ATG and the 5' end of the transcript, 
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The plasmid insertion vector should contain 
transcriptional and translational regulatory elements that 
are active in vaccinia virus. The plasmid should be 
configured so that the polymerase sequences are under the 
5 control of a promoter , active in vaccinia virus. Promoters 
which can be used in the insertion vectors include but are 
not limited to the vaccinia virus thymidine kinase (TK) 
promoter, the 7.5K promoter (Cochran et al., 1985, J. Virol. 
54:30-37), the 11K promoter (European Patent Publication 
10 0198328), the F promoter (Paoletti et al., 1984, Proc. Natl. 
Acad. Sci. USA 81:193-197),. and various early and late 
vaccinia promoters (see Moss, 1990, Virology, 2d ed., ch. 74, 
Fields et al., eds. , Raven Press, Ltd., New York, pp. 2079- 
2111) . 

15 In a specific embodiment, the plasmid insertion 

vector contains (for eventual transfer into vaccinia virus) a 
T7 RNA polymerase coding sequence under the control of a 
promoter active in vaccinia virus. In another specific 
embodiment, a plasmid insertion sector contains a co- 

20 expression system consisting of divergently oriented 

promoters, one directing transcription of the polymerase 
sequences, the other directing transcription of a reporter 
gene or selectable marker, to facilitate detection or 
selection of the eventual recombinant vaccinia virus (see, 

25 e.g., Fuerst et al., 1987, Mol. Cell. Biol. 5:1918-1924). 

As described supra, the plasmid insertion vector 
contains at least one set of polymerase coding sequences 
operatively linked to a promoter, flanked by sequences 
preferably nonessential for vaccinia viral replication. Such 

30 nonessential sequences include but are not limited to the TK 
gene (Mackett et al., 1984, J. Virol. 49:857-864), the 
vaccinia Hindlll-F DNA fragment (Paoletti et al. , 1984, Proc. 
Natl. Acad. Sci. USA 81:193-197), the vaccinia growth factor 
gene situated within both terminal repeats (Buller et al., 

35 1988, J. Virol. 62:866-874), the N2 and Ml genes (Tamin et 
al., 1988, Virology 165:141-150), the Ml subunit of the 
ribonucleotide reductase gene in the vaccinia HindIII-I DNA 
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fragment (Child et al., 1990, virology 174:625-629), the 
vaccinia hemagglutinin (Shida et al., 1988, J. Virol. 
62:4474-4480), vaccinia 14 kD fusion protein gene (Rodriguez 
et al., 1989, Proc. Natl. Acad. Sci. USA 86:1287-1291), etc. 
5 (see also Buller and Palumbo, 1991, Microbiol. Rev. 55(1): 80- 
122) . TK sequences are preferred for use; use of such 
sequences results in the generation of TK" recombinant 
viruses . 

Recombinant vaccinia viruses are preferably 

10 produced by transfection of the recombinant insertion vectors 
containing the polymerase sequences into cells previously 
infected with vaccinia virus. Alternatively, transfection 
can take place prior to infection with vaccinia virus. 
Homologous recombination takes place within the infected 

15 cells and results in the insertion of the foreign gene into 
the viral genome, in the region corresponding to the 
insertion vector flanking regions. The infected cells can be 
screened using a variety of procedures such as immunological 
! \- techniques, DNA plaque hybridization, or genetic selection 

2 0 for recombinant viruses which subsequently can be isolated. 
These vaccinia recombinants preferably retain their essential 
functions and infectivity and can be constructed to 
accommodate up to approximately 3 5 kilobases of foreign DNA. 

Transf ections may be performed by procedures known 

25 in the art, for example, a calcium chloride-mediated 
procedure (Mackett et al., 1985, The construction and 
characterization of vaccinia virus recombinants expressing 
foreign genes, in DNA Cloning, Vol. II, Rickwood and Hames 
(eds.), IRL Press, Oxford-Washington, D.C.) or a 

30 ; liposome-mediated procedure (Rose et al., 1991, Biotechniques 
10:520-525). 

Where, as is preferred, flanking TK sequences are 
used to promote homologous recombination, the resulting 
recombinant viruses thus have a disrupted TK region, 
35 permitting them to grow on a TK~ host cell line such as Rat2 
(ATCC Accession No. CRL 1764) in the presence of 

- 47 - 



BNSDOCID: <WO 9634625A1_IA> 



WO 96/34625 



PCT/US96/06053 



5-bromo-2 • -deoxyuridine (BUDR) , under which conditions 
non-recombinant (TK + ) viruses will not grow. 

In another embodiment, recombinant vaccinia viruses 
of the invention can be made by in vitro cloning, and then 
5 packaging with a poxvirus sensitive to a selection condition, 
rather than by homologous recombination (see International 
Publication No. WO 94/12617 dated June 9, 1994). For 
example, the HBV DNA sequences can be inserted into vaccinia 
genomic DNA using standard recombinant DNA techniques in 
10 vitro; this recombinant DNA can then be packaged in the 
presence of a "helper" poxvirus such as a temperature 
sensitive vaccinia virus mutant or a fowlpox virus which can 
be selected against under the appropriate conditions. 

Various vaccinia virus strains known in the art can 
15 be used to generate the recombinant viruses of the invention. 
A preferred vaccinia virus is the New York City Department of 
Health Laboratories strain, prepared by Wyeth (available from 
the American Type Culture Collection (ATCC) , Accession No. 
VR-325) . other vaccinia & tiding -include but are not limited 
20 to the Elstree and Moscow strains, the strain of Rivers (CV-l 
and CV-2), and the LC16m8 strain of Hashizume. 

Selection of the recombinant vaccinia virus can be 
by any method known in the art, including hybridization 
techniques (e.g. , using polymerase DNA sequences as a 
25 hybridization probe), immunological techniques (e.g., assay 
for binding to antibodies recognizing the encoded polymerase 
epitope (s)), etc. In a preferred aspect where TK flanking 
sequences are used in the insertion vector, selection is for 
TK"" recombinants, as described above; screening for the 
30 correct recombinant can then be carried out by standard 

molecular analyses. In many preferred aspects, the method of 
choice for selection is dictated by the selectable marker in 
an insertion vector used to generate the recombinant viruses. 

The selected recombinant vaccinia virus is then 
35 generally plaque-purified, and preferably subjected to 
standard nucleic acid and protein analyses to verify its 
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identity and purity, and expression of the inserted 
polymerase. 

5.5. LARGE SCALE GROWTH AND PURIFICATION 

OF RECOMBINANT REPLI CABLE VESICULOVIRUSES 

5 

The recovered recombinant vesiculovirus, after 
plague-purification, can then be grown to large numbers, by 
way of example, as follows. Virus from a single plaque (-10 6 
pfu) is recovered and used to infect ~10 7 cells (e.g., BHK 

1Q cells), to yield, typically, 10 ml at a titer of 10 9 -10 10 
pfu/ml for a total of approximately 10 n pfu. Infection of 
~10 12 cells can then be carried out (with a multiplicity of 
infection of 0.1), and the cells can be grown in suspension 
culture, large dishes, or roller bottles by standard methods. 

15 It. is noted that recombinant vesiculoviruses which 

no longer express the extracellular region of the 
vesiculovirus G protein (which determine host range) and 
which, instead, express an envelope glycoprotein of a 
different virus will need to be grown in calls which are 

20 susceptible to infection by the different virus (and which 
cells thus express a receptor promoting infection by a virus 
expressing the envelope glycoprotein of the different virus) . 
Thus, for example, where the recombinant vesiculovirus 
expresses the HIV envelope glycoprotein, the virus is grown 

25 in CD4 + cells (e.g., CD4 + lymphoid cells). 

Virus for vaccine preparations can then be 
collected from culture supernatants , and the supernatants 
clarified to remove cellular debris. If desired, one method 
of isolating and concentrating the virus that can be employed 

30 is by passage of the supernatant through a tangential flow 
membrane concentration. The harvest can be further reduced 
in volume by pelleting through a glycerol cushion and by 
concentration on a sucrose step gradient. An alternate 
method of concentration is affinity column purification 

35 (Daniel et al. , 1988, Int. J. Cancer 41:601-608). However, 
other methods can also be used for purification (see, e.g., 
Arthur et al., 1986, J. Cell. Biochem. Suppl. 10A:226), and 
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any possible modifications of the above procedure will be 
readily recognized by one skilled in the art. Purification 
should be as gentle as possible, so as to maintain the 
integrity of the virus particle. 

5 

5.6. RECOMBINANT REPLI CABLE VESICULOVIRUSES 
FOR USE AS LIVE VACCINES 

In one embodiment of the invention, the recombinant 

replicable vesiculoviruses that express an immunogenic 

10 Antigen are used as live vaccines. 

The recombinant vesiculoviruses for use as 
therapeutic or prophylactic live vaccines according to the 
invention are preferably somewhat attenuated. Most available 
strains e.g., laboratory strains of VSV, may be sufficiently 

15 attenuated for use. Should additional attenuation be 

desired, e.g., based on pathogenicity testing in animals, 
attenuation is most preferably achieved simply by laboratory 
passage of the recombinant vesiculovirus (e.g., in BHK or any 
other suitable, cell line) . ^<suts± ;0 ly* attenuated viruses are 

20 obtainable by numerous methods known in the art including but 
not limited to chemical mutagenesis, genetic insertion, 
deletion (Miller, 1972, Experiments in Molecular Genetics, 

i 

Cold Spring Harbor Laboratory, Cold Spring Harbor, NY) or 
recombination using recombinant DNA methodology (Maniatis et 

25 al., 1982, Molecular Cloning, A Laboratory Manual, Cold 

Spring Harbor Laboratory, Cold Spring Harbor, NY), laboratory 
selection of natural mutants, etc. 

In this embodiment of the invention, a vaccine is 
formulated in which the immunogen is one or several 

30 recombinant vesiculovirus (es) , in which the foreign RNA in 
the genome directs the production of an Antigen in a host so 
as to elicit an immune (humoral and/or cell mediated) 
response in the host that is prophylactic or therapeutic. In 
an embodiment wherein the Antigen displays the antigenicity 

35 or immunogenicity of an antigen of a pathogen, administration 
of the vaccine is carried out to prevent or treat an 
infection by the pathogen and/or the resultant infectious 
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disorder and/or other undesirable correlates of infection. 
In an embodiment wherein the Antigen is a tumor antigen, 
administration of the vaccine is carried out to prevent or 
treat tumors (particularly, cancer). 
5 In a preferred specific embodiment, the recombinant 

vesiculoviruses are administered prophylactically , to 
prevent /protect against infection and/or infectious diseases 
or tumor (e.g., cancer) formation. 

In a specific embodiment directed to therapeutics, 

10 the recombinant vesiculoviruses of the invention, encoding 
immunogenic epitope (s) , are administered therapeutically, for 
the treatment of infection or tumor formation. 
Administration of such viruses, e.g., to neonates and other 
human subjects, can be used as a method of immunostimulation, 

15 to boost the host's immune system, enhancing cell-mediated 
and/or humoral immunity, and facilitating the clearance of 
infectious agents or tumors. The viruses of the invention 
can be administered alone or in combination with other 
•cherdpi"«o (examples of anti-viral therapies, including but 

20 not limited to a-interferon and vidarabine phosphate; 
examples of tumor therapy including but not limited to 
radiation and cancer chemotherapy) . 

5.7. INACTIVATED RECOMBINANT 
25 VESICULOVIRUSES FOR VACCINE USE 

In a specific embodiment, the recombinant 

replicable vesiculoviruses of the invention are inactivated 

(i.e., killed, rendered nonreplicable) prior to vaccine use, 

to provide a killed vaccine. Since the vesiculovirus 

30 envelope is highly immunogenic, in an embodiment wherein one 
or more foreign proteins (e.g., an envelope glycoprotein of a 
virus other than a vesiculovirus) is incorporated into the 
vesiculovirus envelope, such a virus, even in killed form, 
can be effective to provide an immune response against said 

35 foreign protein (s) in a host to which it is administered. In 
a specific embodiment, a multiplicity of Antigens, each 
displaying the immunogenicity or antigenicity of an envelope 
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glycoprotein of a different virus, are present in the 

recombinant vesiculovirus particle. 

The inactivated recombinant viruses of the 

invention differ from defective interfering particles in 
5 that, prior to inactivation the virus is replicable (i.e., it 

encodes all the vesiculovirus proteins necessary to enable it 

to replicate in an infected cell). Thus, since the virus is 

originally in a replicable state, it can be easily propagated 

and grown to large amounts prior to inactivation, to provide 
10 a large amount of killed virus for use in vaccines, or for 

purification of the expressed antigen for use in a subunit 

vaccine (see Section 5.8, infra). 

Various methods are known in the art and can be 

used to inactivate the recombinant replicable vesiculoviruses 
15 of the invention, for use as killed vaccines. Such methods 

include but are not limited to inactivation by use of 

formalin, betapropiolactone, gamma irradiation, and psoralen 

plus ultraviolet light. 

In a specific embodiment , r ^cabiriunt vesiculovirus 
2 0 can be readily inactivated by resuspension of purified 

virions in a suitable concentration of formaldehyde. While 
0.8 formaldehyde may be sufficient, verification of the 
optimum concentration of formaldehyde can be readily 
determined for a particular virus by titration of serial 
25 dilutions of formaldehyde with infectious virus to determine 
the inactivation curve of formalin for that virus. This 
technique has been described in detail by Salk and Gori, 
1960, Ann. N.Y. Acad. Sci. 83:609-637). By extrapolation to 
zero, the concentration expected to inactivate the last 
30 infectious particle can be estimated. By utilizing a 

substantially higher concentration, e.g., 4-fold greater than 
the estimated concentration, complete inactivation can be 
assured . 

Although formalin inactivation alone has proven to 
35 be effective, it may be desirable, for safety and regulatory 
purposes, to kill the virus twice or more, using one or more 
of the numerous other methods currently known for virus 
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inactivation. Thus, although not essential, it is 
contemplated that the virus used in the final formulation 
will be often inactivated by a second agent after treatment 
with formalin. 

5.8. USE OF RECOMBINANT REPLI CABLE 
VESICULOVIRUSES IN THE 
PRODUCTION OF SUBUNIT VACCINES 

Since the recombinant vesiculoviruses of the 
invention can be propagated and grown to large amounts, where 
the recombinant vesiculoviruses express an Antigen, growth of 
such vesiculoviruses provides a method for large scale 
production and ready purification of the expressed Antigen, 
particularly when the Antigen is incorporated into the 
envelope of the recombinant vesiculovirus. In a specific 
embodiment, the Antigen is all or a portion of an envelope 
glycoprotein of another virus, e.g., HIV gpl60, expressed as 
a nonfusion protein, or expressed as a fusion to the 
cytoplasmic domain of a vesiculovirus G protein. 

The Antigens thus produced and purified have use in 
subunit vaccines. 

The recombinant vesiculoviruses that express an 
Antigen can also be used to recombinantly produce the Antigen 
in infected cells in vitro, to provide a source of Antigen 
for use in immunoassays, e.g., to detect or measure in a 
sample of body fluid from a vaccinated subject the presence 
of antibodies to the Antigen, and thus to diagnose infection 
or the presence of a tumor and/or monitor immune response of 
the subject subsequent to vaccination. 

5.9. DETERMINATION OF VACCINE EFFICACY 
Immunopotency of the one or more Antigen (s) in its 

live or inactivated vesiculovirus vaccine formulation, or in 
is subunit vaccine formulation, can be determined by 
monitoring the immune response of test animals following 
immunization with the recombinant vesiculovirus (es) 
expressing the Antigen (s) or with the subunit vaccine 
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containing the Antigen, by use of any immunoassay known in 
the art. Generation of a humoral (antibody) response and/or 
cell-mediated immunity, may be taken as an indication of an 
immune response. Test animals may include mice, hamsters, 
5 dogs, cats, monkeys, rabbits, chimpanzees, etc., and 
eventually human subjects. 

Methods of introduction of the vaccine may include 
oral, intracerebral, intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, intranasal or any 
10 other standard routes of immunization. The immune response 
of the test subjects can be analyzed by various approaches 
such as: the reactivity of the resultant immune serum to the 
Antigen, as assayed by known techniques, e.g., enzyme linked 
immunosorbent assay (ELISA) , immunoblots , 
15 radioimmunoprecipitations, etc.; or, in the case where the 
Antigen displays the antigenicity or immunogenicity of a 
pathogen's antigen, by protection of immunized hosts from 
infection by the pathogen and/or attenuation of symptoms due 
to infection by the pathogen in inni^isai Lest.* ; or , in the 
20 case where the antigen displays the antigenicity or 

immunogenicity of a tumor antigen, by prevention of tumor 
formation or prevention of metastasis, or by regression, or 
by inhibition of tumor progression, in immunized hosts. 

As one example of suitable animal testing of a live 
25 vaccine, live vaccines of the invention may be tested in 

rabbits for the ability to induce an antibody response to the 
Antigens. Male specific-pathogen-free (SPF) young adult New 
Zealand White rabbits may be used. The test group of rabbits 
each receives approximately 5 x 10 8 pfu (plague forming units) 
30 of the vaccine. A control group of rabbits receives an 
injection in 1 mM Tris-HCl pH 9.0 of a non-recombinant 
vesiculovirus or of a recombinant vesiculovirus which does 
not express the same Antigen. 

Blood samples may be drawn from the rabbits every 
55 one or two weeks, and serum analyzed for antibodies to the 
Antigen(s). The presence of antibodies specific for the 
Antigen(s) may be assayed, e.g., using an ELISA. 
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Animals may also be used to test vaccine efficacy 
(e.g., challenge experiments). For example, in a specific 
embodiment regarding a live vaccine formulation, monkeys each 
receive intradermally approximately 5 x 10 8 pfu of recombinant 
5 vesiculovirus. A control monkey receives (control) non- 
recombinant virus intradermally. Blood is drawn weekly for 
12 weeks, and serum is analyzed for antibodies to the 
Antigen (s) . 

10 5.10. VACCINE FORMULATION AND ADMINISTRATION 

The vaccines of the invention may be multivalent or 
univalent. Multivalent vaccines are made from recombinant 
viruses that direct the expression of more than one Antigen, 
from the same or different recombinant viruses. 
15 Many methods may be used to introduce the vaccine 

formulations of the invention; these include but are not 
limited to oral, intradermal, intramuscular, intraperitoneal, 
intravenous, subcutaneous, intranasal routes, and via 
* scarification (scratching through chc. +r^p layers of skin, 
20 e.g., using a bifurcated needle). 

The patient to which the vaccine is administered is 
preferably a mammal, most preferably a human, but can also be 
a non-human animal including but not limited to cows, horses, 
sheep, pigs, fowl {e.g., chickens), goats, cats, dogs, 
25 hamsters, mice and rats. In the use of a live vesiculovirus 
vaccine, the patient can be any animal in which vesiculovirus 
replicates (for example, the above- listed animals). 

The virus vaccine formulations of the invention 
comprise an effective immunizing amount of one or more 
30 recombinant vesiculoviruses (live or inactivated, as the case 
may be) and a pharmaceutically acceptable carrier or 
excipient. Subunit vaccines comprise an effective immunizing 
amount of one or more Antigens and a pharmaceutically 
acceptable carrier or excipient. Pharmaceutically acceptable 
35 carriers are well known in the art and include but are not 
limited to saline, buffered saline, dextrose, water, 
glycerol, sterile isotonic aqueous buffer, and combinations 
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• thereof, one example of such an acceptable carrier is a 
physiologically balanced culture medium containing one or 
more stabilizing agents such as stabilized, hydrolyzed 
proteins, lactose, etc. The carrier is preferably sterile. 
S The formulation should suit the mode of administration. 

The composition, if desired, can also contain minor 
amounts of wetting or emulsifying agents, or pH buffering 
agents. The composition can be a liquid solution, 
suspension, emulsion, tablet, pill, capsule, sustained 
10 release formulation, or powder. Oral formulation can include 
standard carriers such as pharmaceutical grades of mannitol, 
lactose, starch, magnesium stearate, sodium saccharine, 
cellulose, magnesium carbonate, etc. 

Generally, the ingredients are supplied either 
15 separately or mixed together in unit dosage form, for 
example, as a dry lyophilized powder or water free 
concentrate in a hermetically sealed container such as an 
ampoule or sachette indicating the quantity of active agent. 
Where the *ww^s;ition is administered by injection, an 
20 ampoule of sterile diluent can be provided so that the 
ingredients may be mixed prior to administration. 

In a specific embodiment, a lyophilized recombinant 
vesiculovirus of the invention is provided in a first 
container; a second container comprises diluent consisting of 
25 an aqueous solution of 50% glycerin, 0.25% phenol, and an 
antiseptic (e.g., 0.005% brilliant green). 

The precise dose of virus, or subunit vaccine, to 
be employed in the formulation will also depend on the route 
of administration, and the nature of the patient, and should 
30 be decided according to the judgment of the practitioner and 
each patient's circumstances according to standard clinical 
techniques. An effective immunizing amount is that amount 
sufficient to produce an immune response to the Antigen in 
the host to which the recombinant vesiculovirus, or subunit 
35 vaccine, is administered. 

In a specific embodiment, an effective immunizing 
amount of a live recombinant vesiculovirus of the present 
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invention is within the range of 10 3 to 10 9 pfu/dose, more 
preferably 10 6 to 10 9 pfu/dose. Boosting is possible but not 
preferred. If boosting is desired, one optionally may boost 
with the Antigen in purified form rather than using a 
5 recombinant vesiculovirus of the invention. 

For inactivated recombinant vesiculovirus vaccines, 
the vaccine formulation comprises an effective immunizing 
amount of the inactivated virus, preferably in combination 
with an immunostimulant; and a pharmaceutical^ acceptable 

10 carrier. As used in the present context, " immunostimulant » 
is intended to encompass any compound or composition which 
has the ability to enhance the activity of the immune system, 
whether it be a specific potentiating effect in combination 
with a specific antigen, or simply an independent effect upon 

15 the activity of one or more elements of the immune response. 
Some of the more commonly utilized immunostimulant compounds 
in vaccine compositions are the adjuvants alum or muramyl 
dipeptide (MDP) and its analogues. Methods of utilizing 
these luateriais are known in the art, ani/ft^is- well within 

20 the ability of the skilled artisan to determine an optimum 
amount of stimulant for a given virus vaccine. It may also 
be desired to use more than one immunostimulant in a given 
formulation. 

The exact amount of inactivated virus utilized in a 
25 given preparation is not critical, provided that the minimum 
amount of virus necessary to provoke an immune response is 
given. A dosage range of as little as about 10 ng, up to 
amount a milligram or more, is contemplated. As one example, 
in a specific embodiment, individual dosages may range from 
30 about 50-650 jig P^r immunization. 

Use of purified Antigens as subunit vaccines can be 
carried out by standard methods. For example, the purified 
protein (s) should be adjusted to an appropriate 
concentration, formulated with any suitable vaccine adjuvant 
35 and packaged for use. Suitable adjuvants may include, but 
are not limited to: mineral gels, e.g., aluminum hydroxide; 
surface active substances such as lysolecithin, pluronic 
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polyols; polyanions; peptides; oil emulsions; alum, and MDP. 
The immunogen may also be incorporated into liposomes, or 
conjugated to polysaccharides and/or other polymers for use 
in a vaccine formulation. In instances where the recombinant 
5 Antigen is a hapten, i.e., a molecule that is antigenic in 
that it can react selectively with cognate antibodies, but 
not immunogenic in that it cannot elicit an immune response, 
the hapten may be covalently bound to a carrier or 
immunogenic molecule; for instance, a large protein such as 
10 serum albumin will confer immunogenicity to the hapten 

coupled to it. The hapten-carrier may be formulated for use 
as a vaccine. 

Effective doses, (immunizing amounts) of the 
vaccines of the invention may also be extrapolated from dose- 

15 response curves derived from animal model test systems. 

The invention also provides a pharmaceutical pack 
or kit comprising one or more containers comprising one or 
more of the ingredients of the vaccine formulations of the 
invention./ Associated with such container (s) can be a notice 

2 0 in the form prescribed by a governmental agency regulating 
the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 

The present invention thus provides a method of 

25 immunizing an animal, or treating or preventing various 

diseases or disorders in an animal, comprising administering 
to the animal an effective immunizing dose of a vaccine of 
the present invention. 

30 5.11. USE OF ANTIBODIES GENERATED 

BY THE VACCINES OF THE INVENTION 

The antibodies generated against the Antigen by 

immunization with the recombinant viruses of the present 

invention also have potential uses in diagnostic 

35 immunoassays, passive immunotherapy, and generation of 

antiidiotypic antibodies. 
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The generated antibodies may be isolated by 
standard techniques known in the art (e .g. , immunoaf f inity 
chromatography, centrif ugation, precipitation, etc.) and used 
in diagnostic immunoassays. The antibodies may also be used 
5 to monitor treatment and/or disease progression. Any 
immunoassay system known in the art, such as those listed 

supra, may be used for this purpose including but not limited 

» 

to competitive and noncompetitive assay systems using 
techniques such as radioimmunoassays, ELISA (enzyme- linked 

10 immunosorbent assays), "sandwich" immunoassays, precipitin 
reactions, gel diffusion precipitin reactions, 
immunodiffusion assays, agglutination assays, complement- 
fixation assays, immunoradiometric assays, fluorescent 
immunoassays, protein A immunoassays and 

15 Immunoelectrophoresis assays, to name but a few. 

The vaccine formulations of the present invention 
can also be used to produce antibodies for use in passive 
immunotherapy, in which short-term protection of a host is 
achieved by che administration of pre-for:*£3. antibody 

2 0 directed against a heterologous organism. 

The antibodies generated by the vaccine 
formulations of the present invention can also' be used in the 
production of antiidiotypic antibody. The antiidiotypic 
antibody can then in turn be used for immunization, in order 

25 to produce a subpopulation of antibodies that bind the 

initial antigen of the pathogenic microorganism (Jerne, 1974, 
Ann. Immunol. (Paris) I25c:373; Jerne, et al., 1982, EMBO J. 
1:234). 

30 6. RECOMBINANT VESICULAR STOMATITIS 

VIRUSES FROM DNA 

We assembled a DNA clone containing the 11,161 

nucleotide sequence of the prototype rhabdovirus, vesicular 

stomatitis virus (VSV) , such that it could be transcribed by 

35 the bacteriophage T7 RNA polymerase to yield a full-length 

positive strand RNA complementary to the VSV genome. 

Expression of this RNA in cells also expressing the VSV 
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nucleocapsid protein and the two VSV polymerase subunits 
resulted in production of VSV with the growth 
characteristics of wild-type VSV. Recovery of virus from DNA 
was verified by: 1) the presence of two genetic tags 
5 generating novel restriction sites in DNA derived from the 
genome; 2) direct sequencing of the genomic RNA of the 
recovered virus, and 3), production of a VSV recombinant in 
which the glycoprotein was derived from a second serotype. 
The ability to generate VSV from DNA opens numerous 

10 possibilities for the genetic analysis of VSV replication. 
In addition, because VSV can be grown to very high titers and 
in large quantities with relative ease, one can genetically 
engineer recombinant VSVs displaying novel antigens. Such 
modified viruses can be used as vaccines conferring 

15 protection against other viruses or pathogenic 

microorganisms, or to produce immunity in general against an 
encoded foreign antigen. 

• - g - Ivv W.TERIALS AND METHODS 

20 Plasmid Construction. The plasmid pVSVFL(+) 

expressing the 11,161 nucleotide positive strand 
(antigenomic) VSV RNA sequence was constructed from four DNA 
fragments cloned into pBluescript SK + (Stratagene) . The 
starting plasmid for the construction, pVSVFL(-) , expressed 

25 the complete negative sense VSV genomic RNA (Indiana 

serotype) from a T7 promoter. This plasmid was generated in 
a nine step cloning procedure that involved joining the five 
original cDNA clones of the VSV mRNAs (Gallione et al., 1981, 
J. Virol. 39:529-535; Rose and Gallione, 1981, J. Virol. 

30 39:519-528; Schubert et al. , 1985, Proc. Natl. Acad. Sci. USA 
82:7984-7988) with gene junction fragments and terminal 
fragments. These fragments were generated by reverse 
transcription and polymerase chain reaction (RT-PCR) (Mullis 
and Faloona, 1987, Methods in Enzymology 155:33 5-3 50) from 

35 VSV genomic RNA (M.A. Whitt, R. Burdine, E.A. Stillman and 
J.K. Rose, manuscript in preparation). To facilitate 
engineering of the VSV genome and to provide genetic tags, 
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unique Mlu I and Nhe I restriction enzyme sites were 
introduced by oligonucleotide-directed mutagenesis into the 
5' and 3' non-coding regions flanking the VSV glycoprotein 
gene prior to construction of the full length genome. 
5 In the initial s step of constructing pVSVFL(+) we 

used the primers 

( 5 ' CCGGCTCGAG TTGTAATACGACTCACTATAGGG ACGAAGACAAACAAACCATTATTAT 
C-3') (SEQ ID NO: 38) and ( 5 ' GAACTCTCCTCTAGATGAGAAC— 3 ' ) (SEQ 
ID NO: 39) to amplify (Mullis and Faloona, 1987, Methods in 
10 Enzymology 155:33 5-350) a 2,124 nucleotide fragment from 

» 

pVSVFL(-) (# 1, Fig. 4 A) . This fragment corresponds to the 
3' end of the VSV genome. The first primer introduced an Xho 
I site and a T7 promoter (underlined) immediately preceding 
the sequence complementary to the 3' end of the VSV genome. 

15 The second primer covered a unique Xba I site present in the 
VSV P gene. The PCR product was digested with Xho I and Xba 
I and cloned into pBluescript SK + (Stratagene) that had been 
digested with Xho I and Xba I. The resulting plasmid 
carrying the sequence corresponding to the 3^ end: -of "Lhe VSV 

20 genome preceded by a T7 promoter was designated pBSXX. Note 
that an additional T7 promoter is also present upstream of 
the Xho I site in the vector. Next we generated the sequence 
corresponding to the 5' end of the VSV genome and part of the 
hepatitis delta virus (HDV) ribozyme (Pattnaik et al., 1992, 

25 Cell 69:1011-1120; Perrotta and Been, 1991, Nature 

350:434-436) . A 147 nucleotide PCR product (#3, Fig. 4A) was 
amplified from pVSVFL(-) with primers 

( 5 ' AGGTCGGACCGCGAGGAGGTGGAGATGCCATGCCGACCC ACGAAGACCACAAAACCAG 
•3') (SEQ ID NO: 4 0) and (5' ATGTTGAAGAGTGACCTACACG-3 ' ) (SEQ ID 

30 NO: 41). The first primer contained 39 nucleotides of the 
sequence encoding the HDV ribozyme (underlined) followed by 
19 nucleotides complementary to the 3' end of the VSV 
antigenomic RNA. The second primer hybridized within the L 
gene (Fig. 4 A) . The PCR product was digested with Af 1 II and 

35 Rsr II and the 80 nucleotide Afl II-Rsr II fragment was 

ligated to a 225 nucleotide Rsr II-Sac I fragment (#4, Fig. 
4A) derived from a plasmid designated pBS-GMG (Stillman et 
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al., manuscript submitted). Fragment 4 contained the T7 
terminator sequence and the remainder of the sequence 
encoding the HDV ribozyme. Ligated products were digested 
with Afl II and Sac I and the 305 nucleotide Afl II-Sac I 
5 product was cloned into the Afl II and Sac I sites of a 

modified pBSXX vector that contained an Afl II site inserted 
at the unique Not I site within the poly linker. This plasmid 
containing the Afl II-Sac I fragment was designated pBXXAS. 
To complete the construction, a 10,077 nucleotide Bst 1107 I 

10 to Afl II fragment (#2, Fig. 4A) containing 90% of the VSV 
sequences from pVSVFL(-) was inserted into the unique Bst 
1107 I and Aflll sites of pBXXAS. The final plasmid was 
designated pVSVFL(-f) . The sequences in this plasmid 
generated by PCR (hatched sequences, Fig. 4B) were determined 

15 and contained no errors. We also prepared a plasmid in which 
the sequence of the VSV Indiana serotype G gene (Mlul-Nhel) 
was replaced with the G gene from the New Jersey serotype of 
VSV (Gallione and Rose, 1983, J. Virol. 46:162-169). This 
plasmid is called pwyFL{+) WJC and has only a single T7 

20 promoter. 

Transf ection and recovery of recombinant VSV. Baby 
hamster kidney cells (BHK-21, ATCC) were maintained in DME 
(Dulbecco's modified Eagle's medium) supplemented with 5% 
fetal bovine serum (FBS) . Cells on 10 cm dishes ("70% 

25 confluent) were infected at a multiplicity of 10 with VTF7-3 
(Fuerst et al., 1986, Proc. Natl. Acad. Sci. USA 
83:8122-8126). After 30 min, plasmids encoding the VSV 
antigenomic RNA and the N, P, and L proteins were transf ected 
into the cells using a calcium phosphate transfection kit 

30 according to directions supplied (Stratagene) . The coding 
regions for N, P, and L proteins were each expressed in 
pBluescript SK(+) from the T7 promoter. Plasmid amounts were 
10 M9 pVSVFL( + ) , 5 fxq pBS-N, 4 fxg pBS-P, and 2 pBS-L. 
After 24-48 h incubation at 37°C in 3% C0 2 , cells were 

35 scraped from the dish and subjected to three rounds of 
freeze-thawing (-70°C, 37°C) to release cell-associated 
virus. Debris was pelleted from the cell lysates by 
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centrifugal: ion at 1,250 x g for 5 min. Five ml of this 
lysate was added to approximately 10 6 BHK cells on a 10 cm 
plate in 10 ml of DME + 5% FBS. After 48 h the medium was 
clarified by centrif ugation at 1,250 x g for 10 min, and 
5 passed through a filter to remove the majority of the 

vaccinia virus (0.2 /Ltm pore size, Gelman Sciences). One ml 
was then added directly to BHK cells that had been plated on 
a coverslip in a 3 5 mm dish. After four hours, the cells 
were fixed in 3% paraformaldehyde and stained with monoclonal 

10 antibody II to the VSV Gj protein (Lefrancois and Lyles, 1982, 
Virology 121:168-174) or 9B5 (Bricker et al. f 1987, Virology 
161:533-540) to the VSV G NJ protein followed by goat anti-mouse 
rhodamine conjugated antibody (Jackson Research) . Cells were 
then examined by indirect immunofluorescence using a Nikon 

15 Microphot-FX microscope equipped with a 4 Ox planapochromat 
objective. When VSV recovery was successful, 100% of the 
cells showed the typical bright stain for G protein 
characteristic . of a VSV infection. 

Preparation ana analysis of VSV RNA and pzc'Zziz*'. 

20 Recombinant VSV and wild-type VSV isolated from single 
plaques (~10 5 plague forming units) were used to infect a 
monolayer of BHK cells (-80% confluent) on a 10 cm dish in 
10 ml DME plus 5% FBS. After 24 h, cell debris and nuclei 
were removed by centrifugation at 1,250 x g for 5 min, and 

25 virus was then pelleted from the medium at 35,000 RPM in a 
Beckman SW41 rotor for one hour. Virus pellets were 
resuspended in 0.5 ml 10 mM Tris-HCl, pH 7.4 for protein 
analysis. For RNA isolation, virus was resuspended in 0.2 ml 
of 0.5% SDS/0.2M sodium acetate, pH 8.0, followed by 

30 extraction with phenol/CHCl 3 . RNA was precipitated with 95% 
ethanol and 5 /tig carrier tRNA. RNA was pelleted by 
centrifugation at 12,000 x g for 15 min and resuspended in 
water with 1 unit RNasin (Promega) . For analysis of RNA by 
RT-PCR, primer pairs flanking either the novel Nhe I or Mlu I 

35 sites were used. The first strand DNA synthesis reaction was 
carried out in 50 jxl of PCR buffer (Promega) containing 5 mM 
MgCl 2 , 1 mM dNTPs, 1 unit RNAs in (Promega), 1 unit avian 
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myeloblastosis virus reverse transcriptase (AMV RT; Promega) 
0.75 /xM primer and approximately 0.25 /ig of VSV genomic RNA. 
Incubation was at 42 °C for 15 min followed by 5 min at 99°C 
and 5 min at 5°C. PCR was carried out by addition of 0.5 U 
5 Taq polymerase, adjustment of MgCl 2 concentration to 1.25 mM , . 
and addition of the second primer (0.75 ^M) . The reaction 
was subjected to 20 thermal cycles: 95°C, 1 min; 60°C 1.5 
min. The reaction was then incubated at 60 °C for 7 min. 

Direct sequencing of VSV genomic RNA was performed 
•10 according to a previously described protocol based on the 
dideoxy chain termination method (Mierendorf and Pfeffer, 
1987, Methods in Enzymology 152:563-566) except that 
[a- 33 P]dATP (Amersham, Inc.) was used. Each reaction included 
approximately 0.25 fig of VSV genomic RNA. 

■ 

6.2 . RESULTS 
To construct a cDNA clone encoding the entire 
11,161 VSV genome, individual cDNA clones of the VSV mRNAs 

- were initially joined ^-ir? email DNA fragments generated by 

20 RT-PCR that covered the four gene junctions. Correct genomic 
terminal sequences were also generated by RT-PCR of the VSV 
genome, and these were joined to the other DNAs using 
restriction sites. This initial clone was constructed with a 
T7 promoter directing synthesis of the full-length negative 

25 strand VSV RNA. Despite numerous attempts, we were unable to 
recover VSV from cells expressing the VSV genomic RNA and the 
VSV N, P, and L proteins. The VSV constructed was thus 
redesigned to express the VSV antigenomic DNA. The 
construction strategy is described in Materials and Methods 

30 and in Fig. 4A-B. The entire VSV sequence as well as a T7 
promoter, terminator and HDV ribozyme sequence were cloned in 
pBluescript SK+ between the Xho I and Sac I sites (Fig. 4B; 
Fig. l) . An additional T7 promoter is also present upstream 
of the Xho I site in the plasmid. A slightly different 

35 cloning strategy was used to generate plasmids lacking the 
upstream T7 promoter and VSV has also been recovered from 
these constructs. 
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Recovery of VSV from DNA. To determine if we could 
recover VSV from plasmid DNA, we infected cells with vaccinia 
vTF7-3 (Fuerst et al., 1986, Proc. Natl. Acad. Sci. USA 
83:8122-8126) to provide cytoplasmic T7 RNA polymerase. 
5 These cells were then transfected with pVSVFL(+) , which 
expresses the antigenomic VSV RNA from a T7 promoter, and 
three other plasmids which express the VSV N, P, and L 
proteins. Expression of the N protein was required to 
assemble nascent VSV antigenomic RNA into nucleocapsids. 
10 Once formed, these nucleocapsids should serve as templates 
for synthesis of minus strand RNA by the L/P polymerase 
complex. Encapsidated minus strand RNA should then be a 
template for transcription, initiating the VSV infectious 
cycle. 

15 The initial recovery experiment employed two 10 cm 

plates of BHK cells (~5 x 10 6 cells each) . At 24 hours after 
the infection with VTF7-3 and transfection with the four 
plasmids, cells and medium were frozen and thawed to release 
any cell-associated VSV, and the clarified lysates trsre. *dde- 

20 to fresh BHK cells. After 48 hours, both plates showed 

severe cytopathic effects that could have been due either to 
vaccinia virus or to recovered VSV. One ml of each 
supernatant was then added to small dishes of BHK cells on 
coverslips. After two hours, one of these coverslips showed 

25 rounded cells characteristic of a VSV infection, while the 
other did not. After 4 hours, cells on both coverslips were 
fixed , stained with appropriate antibodies, and examined by 
indirect immunofluorescence microscopy to detect the VSV G 
protein. All cells on the coverslip showing rounded cells 

30 revealed intense fluorescence characteristic of G protein 

expression during VSV infection (data not shown) . Subsequent 
passaging and analysis described below showed that VSV had 
been recovered from the transfection. The other coverslip 
showed no G expression, and no VSV could be recovered after 

35 passaging. 

Based on the frequency with which rabies virus 
(Schnell et al., 1994, EMBO J. 13:4195-4203) and VSV 
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minigenomes (Stillman et al. # manuscript submitted) were 
recovered, we anticipated that recovery of complete VSV, if 
obtainable, would be a rare event. The initial recovery of 
VSV from only one of two transf ect ions suggested the 
5 possibility that the initial titer in the positive lysate was 
very low. To examine this titer, we infected BHK cells on 
coverslips with one tenth of the lysate (1 ml) derived from 
each initial transf ection. After eight hours, the cells were 
examined for expression of G protein by indirect 

10 immunofluorescence. A scan of the entire coverslip revealed 
no VSV infection from the negative lysate, and only five 
small areas of infection (2-6 cells each) from the lysate 
that gave rise to VSV G expression on subsequent passaging. 
The initial titer was therefore very low as we suspected, and 

15 likely represented a total of about 50 infectious particles, 
probably derived from a VSV infection initiated in only one 
cell out of 2 x 10 7 transf ected. This low rate of recovery of 
infectious VSV is typical of that observed in several 
'' experiments. - • 

2 0 Analysis of viral proteins. Subsequent passages 

and plague assays of VSV recovered in three independent 
experiments revealed plaques that were detectable in less 
than 16 hours and titers up to 2 x 10 9 pfu/ml characteristic 
of VSV. For further verification that VSV had been 

25 recovered, the proteins in virus pelleted from the medium 
were examined by SDS polyacrylamide gel electrophoresis 
(PAGE) . Fig. 5 shows the Coomassie stained gel of proteins 
of VSV recovered from recombinant DNA (rVSV) and wildtype 
VSV. The mobilities and relative amounts of the five viral 

30 proteins were indistinguishable in the wildtype and 
recombinant virus. 

Identification of sequence tags. In pVSVFL(-f) , the 
VSV nucleotide sequence was altered by oligonucleotide- 
directed mutagenesis to generate unique Mlu I and Nhe I 

35 restriction enzyme sites in the 5' and 3' non-coding regions 
of the glycoprotein gene. To verify that these sites were 
present in recovered virus, we carried out reverse 
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transcription of genomic RNA purified from wild-type or 
recombinant virions using primers upstream of each 
restriction site. The reverse transcription products were 
then amplified by PCR using an additional primer downstream 
5 of each restriction site. The presence of the genetic tag in 
the recombinant virus was verified by digestion of the PCR 
products with the appropriate restriction enzymes. Using 
this method, the presence of both the Mlu I and Nhe I 
sequences in the recovered virus RNA was verified, and the 

10 results for the Nhe I site are shown in Fig. 6. Sequences 
from wild-type VSV and recombinant VSV were amplified in 
parallel and a 620 nucleotide fragment was obtained in both 
cases (lanes 3 and 5) . No product was obtained when reverse 
transcriptase was omitted from the reactions prior to PCR 

15 (lanes 1 and 2), indicating that the PCR product was derived 
from RNA, not from contaminating DNA. After digestion with 
Nhe I, expected fragments of 273 and 347 base pairs were 
obtained from recombinant VSV RNA, while the DNA derived from 
the wildtype RNA remained audi g t^ted (lanes 4 and 6) . 

20 Direct sequencing of tagged genomic RNA. The 

presence of new restriction sites in the DNA generated by PCR 
provided strong evidence that VSV had been recovered from 
DNA. To ensure that identification of the genetic tags by 
PCR had not resulted from inadvertent contamination by 

25 plasmid DNA, we carried out direct sequence analysis of the 
genomic RNA using reverse transcriptase and a primer 
hybridizing upstream of the Nhe I site. The sequence from 
the autoradiogram shown in Fig. 7 is in exact agreement with 
the published sequence of the VSV G mRNA (Rose and Gallione, 

30 1981, J. Virol. 39:519-528) except that the four nucleotide 
changes used to generate the Nhe I site (GCACAA to GCTAGC) 
are present. These results show unequivocally that the 
sequence tag is present in the genomic RNA. 

Recombinant VSV Indiana virus carrying the 

35 glycoprotein of the New Jersey serotype. There are two 
serotypes of VSV designated Indiana and New Jersey. The 
glycoproteins of the two serotypes share approximately 50% 
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sequence identity (Gallione and Rose, 1983, J. Virol. 
46:162-169). In earlier studies we found that the 
glycoprotein of the New Jersey serotype could complement a 
mutant of the VSV, serotype that makes a defective 
5 glycoprotein (Whitt et al., 1989, J. Virol. 63:3569-3578). 
It therefore seemed likely that a recombinant VSV in which 
the Indiana glycoprotein (Gj) gene was replaced by the New 
Jersey glycoprotein (G^) gene would be viable despite the 
extensive sequence divergence. To generate such a 

10 recombinant, the G NJ cDNA was amplified by PCR using primers 
that introduced Mlu I and Nhe I sites within the 5' and 3' 
non-coding regions at each end of the gene. The amplified 
DNA was cloned into pBluescript and the G NJ protein was 
expressed in BHK cells using the vaccinia-T7 system. The 

15 protein expressed was shown to have membrane fusion activity 
below pH 6.0 indicating that it was functional (data not 
shown). This G NJ cDNA was then cloned into the unique Mlu I 
and Nhe I sites of the full-length construct after removal of 
sequences encoding Gj Recombinant VEV was recovered 

20 essentially as described above except that the initial 

transfectipn was allowed to proceed for 48 hours before the 
freeze-thaw step. After the first passage, expression of the 
Gnj protein was verified by indirect immunofluorescence using 
a monoclonal antibody specific to G w (Bricker et al., 1987, 

25 Virology 161:533-540). The virus was then plaque purified 
and grown. To examine the proteins present in the 
recombinant virus, virus recovered from cells infected with 
VSVj , VSV NJ , and the recombinant VSV^c was analyzed by SDS- 
PAGE followed by Coomassie staining. The VSVj G, N, P, and M 

30 proteins each have mobilities distinct from their VSV NJ 

counterparts (Fig. 8, lanes 1 and 3). The recombinant VSV^o 
shows the mobility difference in only the G protein as 
expected (lane 2) . The presence of the novel Nhe I and Mlu I 
sites in the recombinant was also verified (data not shown) . 
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6.3. DISCUSSION 

The results presented here establish that 
infectious VSV can be recovered from recombinant DNA. We 
believe that expressing the positive strand, antigenomic RNA 
5 in the presence of the N, P and L proteins was critical to 
our success because we have not recovered virus starting with 
an equivalent construct encoding the genomic RNA. 

Why is the initial event of generating VSV so rare, 
apparently occurring in only 1 in 10 7 to 10 8 transfected cells? 

10 One possibility is that our clone contains a sequence error 
that is only corrected by a rare mutational event. We 
believe this is not the case because the clone was completely 
sequenced prior to assembly and differences from published 
sequences were corrected, or the proteins were shown to be 

15 functional in complementation assays. Also, the frequency of 
recovery is actually higher than expected based on our 
observations with minigenomes encoding one or two VSV 
proteins (Stillman et al., manuscript submitted). In these 
cases we found that a transcribing ~nd replicating minigenc~e 

20 (~2kb RNA) was recovered in about 1 in 10 2 transfected cells 
expressing the RNA with the N,P and L proteins. Addition of 
a second cistron (0.85 kb additional RNA) encoding the M 
protein dropped the recovery rate to approximately 1 in 10 3 
transfected cells. If there is a ten-fold drop in recovery 

25 rate for each additional kilobase of RNA added, one can 

easily rationalize an even lower frequency of recovery for 
the 11, 161 kb genome than we observed. Although these 
minigenomes encode negative sense RNAs, the comparison of the 
frequency of recovery to that of the full length plus 

30 construct is probably valid because expression of the N, P 
and L mRNAs would not generate mRNAs complementary to the 
minigenome. 

Although the rate limiting step in generation of 
infectious VSV is not known, it is likely to be at the level 
35 of synthesis and encapsidation of the large antigenomic RNA, 
which must occur prior to replication and transcription. The 
complete encapsidation with N protein probably has to occur 
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on the nascent RNA to protect it from degradation, and the 
cells in which this occurs must also produce appropriate 
amounts of L and P proteins to initiate replication. Once 
this has occurred, however, the transcription and translation 
5 of the genome should generate additional N, P, and L proteins 
as well as the G and M proteins required for budding of 
infectious virus. 

The recovery of VSV from DNA opens numerous aspects 
of the viral life cycle to genetic analysis. The studies of 

10 the genetic signals involved in transcription and replication 
have so far been confined to analysis of defective RNAs that 
do not encode viral proteins (Pattnaik et al., 1992, Cell 
69:1011^1120; Wertz et al., 1994, Proc. Natl. Acad. Sci. USA 
91:8587-8591). These and other signals can be now examined 

15 in the context of a VSV infection occurring in the absence of 
a vaccinia virus infection. The system we have described 
also provides an opportunity to study the roles of individual 
viral protein domains and modifications in viral assembly and 
ie^ixcacion. Previously these an^lysp c have been confined to 

20 in vitro systems or to analysis employing the complementation 
of naturally occurring mutants where synthesis of the mutant 
protein can complicate the analysis. 

Perhaps even more exciting is the ability to use 
VSV as a vector to express other proteins. The experiment in 

25 which we recovered VSV Indiana carrying the glycoprotein from 
the New Jersey serotype (Fig. 8) illustrates that viable 
recombinants can be made. For reasons that are unclear the 
titers of recombinant virus were at least ten-fold lower than 
those obtained with either parent. The lower titer apparently 

30 did not result from a defect in viral assembly because the 
amounts of proteins in wildtype and recombinant virions at 
the end of the infection were comparable (Fig. 8) . Our 
previous experiments showed that a foreign glycoprotein 
carrying the appropriate cytoplasmic tail signal could be 

35 incorporated into the VSV envelope (Owens and Rose, 1993, J. 
Virol. 67:360-365). This suggests that one may generate 
recombinant VSVs carrying novel proteins in their envelopes. 

- 70 - 



BNSDOCID: <WO. 



,9634625A1JA> 



WO 96/34625 



PCT/US96/06053 



If these were appropriately attenuated, they can be used as 
vaccines against other viral diseases. 

The truncated genomes of defective interfering 
particles are replicated and packaged very well, thus we 
5 suspect that there will be flexibility in the maximum length 
of the genome that can be packaged as well. Presumably a 
longer nucleocapsid can be packaged as a longer bullet-shaped 
particle. Because of the modular nature of the VSV genome, 
with conserved gene end and start sequences at the gene 
10 junctions (Rose and Schubert, 1987, in The Viruses: The 

Rhabdoviruses, Plenum Publishing Corp., NY, pp. 129-166), it 
should be relatively easy to engineer additional genes into 
VSV. 

15 7. DEPOSIT OF MICROORGANISMS 

Plasroid pVSVFL(-f) was deposited on May 2, 1995 with 
the American Type Culture Collection (ATCC) , 1201 Parklawn 
Drive, Rockville, Maryland 20852, under the provisions of the 
Budapest Treaty on the International Recognition of tlii 
20 Deposit of Microorganisms for the Purposes of Patent 
Procedures, and assigned accession no. 97134. 

The present invention is not to be limited in scope 
by the microorganism deposited or the specific embodiments 

25 described herein. Indeed, various modifications of the 

invention in addition to those described herein will become 
apparent to those skilled in the art from the foregoing 
description and accompanying figures. Such modifications are 
intended to fall within the scope of the appended claims. 

30 Various publications are cited herein, the 

disclosures of which are incorporated by reference in their 
entireties . 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Rose, John K. 

(ii) TITLE OF INVENTION: RECOMBINANT VESICULOVIRUSES AND THEIR 
USES 

s 

(iii) NUMBER OF SEQUENCES: 41 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PENNIE & EDMONDS 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10036-2711 

» 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: On Even Date Herewith 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION- 

(A) NAME: Misrock, S. Lesiie 

(B) REGISTRATION NUMBER lb,o~*2 

(C) REFERENCE /DOCKET NUMBER^ 6523-009-228 

(ix) TELECOMMUNICATION INFORMATION • 

(A) TELEPHONE: (212) 790-9090 

(B) TELEFAX: (212) 869-9741/8864 

(C) TELEX: 66141 PENNIE 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 311 base pairs 
.(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 760.. 2025 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2092.. 2886 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 2946.. 3632 

» 

(ix) FEATURE: « 
(A) NAME /KEY : CDS 
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(B) LOCATION: 3774.. 5306 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

<B) LOCATION: 5429.. 11755 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 



CACCTAAATT 


GTAAGCGTTA 


ATATTTTGTT 


AAAATTCGCG 


TTAAATTTTT 


G TTAAATCAG 


60 

w w 


CTCATTTTTT 


AACCAATAGG 


CCGAAATCGG 


CAAAATCCCT 


TATAAATCAA 


AAGAATAGAC 


120 


CGAGATAGGG 


TTGAGTGTTG 


TTCCAGTTTG 


GAACAAGAGT 


CCACTATTAA 


AGAACGTGGA 


180 


CTCCAACGTC 


AAAGGGCGAA 


AAACCGTCTA 


TCAGGGCGAT 


GGCCCACTAC 


GTGAACCATC 


240 


ACCCTAATCA 


AGTTTTTTGG 


GGTCGAGGTG 


CCGTAAAGCA 


CTAAATCGGA 


ACCCTAAAGG 


300 


GAGCCCCCGA 


TTTAGAGCTT 


GACGGGGAAA 


GCCGGCGAAC 


GTGGCGAGAA 


AGGAAGGGAA 


360 


GAAAGCGAAA 


GGAGCGGGCG 


CTAGGGCGCT 


GGCAAGTGTA 


GCGGTCACGC 


TGCGCGTAAC 


420 


CACCACACCC 


GCCGCGCTTA 


ATGCGCCGCT 


ACAGGGCGCG 


TCCCATTCGC 


CATTCAGGCT 


480 


GCGCAACTGT 


TGGGAAGGGC 


GATCGGTGCG 


GGCCTCTTCG 


CTATTACGCC 


AGCTGGCGAA 


540 


AGGGGGATGT 


GCTGCAAGGC 


GATTAAG TTG 


GGTAACGCCA 


GGGTTTTCCC 


AGTCACGACG 


600 


TTGTAAAACG 


ACGGCCAGTG 


AATTGTAATA 


CGACTCACTA 


TAGGGCGAAT 


TGGGTACCGG 


660 


GCCCCCCCTC 


GAGTTGTAAT 


ACGACTCACT 


ATAGGGACGA 


AGACAAACAA 


ACCATTATTA 


720 


T JATTAAAAG, CCTCAGGAGA AACTTTAACA 


GTAATCAAA ATG TCT GTT 


ACA GTC 


77 i 



Met Ser Val Thr Val 
1 5 

AAG AGA ATC ATT GAC AAC ACA GTC ATA GTT CCA AAA CTT CCT GCA AAT 822 
Lys Arg lie lie Asp Asn Thr Val lie Val Pro Lys Leu Pro Ala Asn 

• 10 15 20 

GAG GAT CCA GTG GAA TAC CCG GCA GAT TAC TTC AGA AAA TCA AAG GAG 870 
Glu Asp Pro Val Glu Tyr Pro Ala Asp Tyr Phe Arg Lys Ser Lys Glu 

25 30 35 

ATT CCT CTT TAC ATC AAT ACT ACA AAA AGT TTG TCA GAT CTA AGA GGA 918 
lie Pro Leu Tyr He Asn Thr Thr Lys Ser Leu Ser Asp Leu Arq Glv 
40 45 50 

TAT GTC TAC CAA GGC CTC AAA TCC GGA AAT GTA TCA ATC ATA CAT GTC 966 
Tyr Val Tyr Gin Gly Leu Lys Ser Gly Asn Val Ser He He His Val 
55 60 65 

AAC AGC TAC TTG TAT GGA GCA TTA AAG GAC ATC CGG GGT AAG TTG GAT 1014 
Asn Ser Tyr Leu Tyr Gly Ala Leu Lys Asp He Arg Gly Lys Leu Asp 
7 ° 75 80 85 

AAA GAT TGG TCA AGT TTC GGA ATA AAC ATC GGG AAA GCA GGG GAT ACA 1062 
Lys Asp Trp Ser Ser Phe Gly He Asn He Gly Lys Ala Gly Asp Thr 

90 95 100 

ATC GGA ATA TTT GAC CTT GTA TCC TTG AAA GCC CTG GAC GGC GTA CTT mo 
He Gly He Phe Asp Leu Val Ser Leu Lys Ala Leu Asp Gly Val Leu 

105 no ' us 

CCA GAT GGA GTA TCG GAT GCT TCC AGA ACC AGC GCA GAT GAC AAA TGG 1158 
Pro Asp Gly Val Ser Asp Ala Ser Arg Thr Ser Ala Asp Asp Lys Trp 



- 73 - 



BNSDOCID:<WO 9634625A1 IA> 



W0 96/34625 PCT/US96/06053 



120 



125 130 



TTG CCT TTG TAT CTA CTT GGC TTA TAC AGA GTG GGC AGA ACA CAR ATG 
Leu Pro Leu Tyr Leu Leu Gly Leu Tyr Arg Val Gly A^g tS nit 

140 - - - 



145 



CCT GAA TAC AGA AAA AAG CTC ATG GAT GGG CTG ACA AAT CAA TGC AAA 
Pro Glu Tyr Arg Lys Lys Leu Met Asp Gly Leu Thr tel Gin cys J£ 

160 165 

ATG ATC AAT GAA CAG TTT GAA CCT CTT GTG CCA GAA GGT CGT GAC ATT 
Met lie Asn Glu Gin Phe Glu Pro Leu Val Pro Glu Gly Arg Asp lie 

170 1" 180 

TTT GAT GTG TGG GGA AAT GAC AGT AAT TAC ACA AAA ATT GTC GCT GCA 
Phe Asp val Trp Gly Asn Asp Ser Asn Tyr Thr Zy7 lie Val a£ Ma 

155 190 



1S5 



GTG GAC ATG TTC TTC CAC ATG TTC AAA AAA CAT GAA TGT GCC TCG TTP 
Val ASP Met. Phe Phe His Met Phe Lys Lys His G^ Cys Alf Ser Phe 



210 



IS 5r" Glv Thr tTI v!T I CC ^ T 3 C *** GAT TGT GCT GC * TTG GCA 
Arg Tyr Gly Thr lie Val Ser Arg Phe Lys Asp Cys Ala Ala Leu Ala 

220 225 

ACA TTT GGA CAC CTC TGC AAA ATA ACC GGA ATG TCT ACA GAA GAT GTA 
Thr Phe Gly His Leu Cys Lys lie Thr Gly Met s" Thr Glu Sp Val 

240 245 

Thr Thr I™ llf T G ^ C CGA GAA GTT GCA GAT GAA ATG GTC CAA ATG 
Thr Thr Trp lie Leu Asn Arg Glu Val Ala Asp Glu Met Val Gin Met 

255 260 

ATG CTT CCA GGC CAA GAA ATT GAC AAG GCC Ml rCA TAC AiG CCT T1 t 
Met Leu Pro Gly Gin Glu lie Asp Lys Ala Asp g° Met p" ™ 

275 

TTG ATC GAC TTT GGA TTG TCT TCT AAG TCT CCA TAT TCT TCC rrr i„ 
Leu lie Asp Phe Gly Leu Ser Ser Lys Ser Pro S Ser Val 

285 — " *■ 



290 



AAC CCT GCC TTC CAC TTC TGG GGG CAA TTG ACA GCT CTT CTG CTC Aft 
Asn Pro Ala Phe His Phe Trp Gly Gin Leu Thr All £u Leu Leu J£ 
■ 3 300 305 

TCC ACC AGA GCA AGG AAT GCC CGA CAG CCT GAT GAC ATT GAG TAT if. 
Ser Thr Arg Ala Arg Asn Ala Arg Gin Pro Asp Asp lie 22 t£ tS 

315 320 325 

121 CTT ACA GCA GGT 770 77(3 TAC TAT GCA GTA GGA TCC TCT 

Ser Leu Thr Thr Ala Gly Leu Leu Tyr Ala Tyr Ala Val Gly Ser IS 

330 335 340 

GCC GAC TTG GCA CAA CAG TTT TGT GTT GGA GAT AAC AAA TA- ACT CPA 
Ala Asp Leu Ala Gin Gin Phe Cys Val Gly Asp A^n Lyt t£ Thl P^ 

350 355 

GAT GAT AGT ACC GGA GGA TTG ACG ACT AAT GCA CCG CCA CAA GGC ArA 
Asp Asp ser Thr Gly Gly Leu Thr Thr Asn Ala Pro Pro Gly A^g 



370 



2j s is a g 2 s 5 s g ss s js s s 

380 

ACT CCT GAT ATG ATG CAG TAT GCG AAA AGA GCA GTC ATG TCA CTG CAA 



1206 



1254 



s 



1302 



1350 



1398 



1446 



14 94 



1542 



1590 



1638 



1686 



1734 



1782 



1830 



1878 



1926 



1974 
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TCC CAA TCA GAA GCA GTA TCA GAT GTT TGG TCT CTC TCA AAG ACA TCP 
Ser Gin Ser Glu Ala Val Ser Asp Val Trp Ser Leu Ser ijs Thr Ser 
190 1SS 200 

Met rf o CC f* 3 ^ k AGT CTT CCT CTC ATA TCC 

Met Thr Phe Gin Pro Lys Lys Ala Ser Leu Gin Pro Leu Thr lie Ser 

2 X 5 



2022 



2078 



2127 



2175 



2223 



2271 
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Thr Pro Asp Met Met Gin Tyr Ala Lys Arg Ala Val Met Ser Leu Gin 
" 90 395 400 405 

GGC CTA AGA GAG AAG ACA ATT GGC AAG TAT GCT AAG TCA GAA TTT GAC 
Gly Leu Arg Glu Lys Thr lie Gly Lys Tyr Ala Lys Ser Glu Phe 

410 415 420 

AAA TGA CCCTATAATT CTCAGATCAC CTATTATATA TTATGCTACA TATGAAAAAA 
Lys 

ACTAACAGAT ATC ATG GAT AAT CTC ACA AAA GTT CGT GAG TAT CTC AAG 

Met Asp Asn Leu Thr Lys Val Arg Glu Tyr Leu Lys 
1 5 10 

TCC TAT TCT CGT CTG GAT CAG GCG GTA GGA GAG ATA GAT GAG ATC GAA 
Ser Tyr Ser Arg Leu Asp Gin Ala Val Gly Glu lie Asp Glu lie Glu 
15 2 ° . 25 

GCA CAA CGA GCT GAA AAG TCC AAT TAT GAG TTG TTC CAA GAG GAT GGA 
Ala Gin Arg Ala Glu Lys Ser Asn Tyr Glu Leu Phe Gin Glu Asp Gly 

3 5 40 

2£ Clf, ^ f** 3 TCT TAT ^ CAG GCA GCA GAT GAT TCT 

Val Glu Glu His Thr Lys Pro Ser Tyr Phe Gin Ala Ala Asp Asp Ser 

**" 50 55 60 

™° A TCT GAA CCA GAA ATT GAA GA C AAT CAA GGT TTG TAT GCA 

Asp, Thr Glu Ser Glu Pro Glu lie Glu. Asp Asn Gin Gly Leu Tyr Ala 

65 70 75 

CCA GAT CCA GAA GCT GAG CAA GTT GAA GGC TTT ATA CAG GGG CCT TTA 
Pro Asp Pro Glu Ala Glu Gin Val Glu Gly Phe lie Gin Gly Pro Leu 

■-«.•£• 85 go 

GAT GAC TAT GCA GAT GAG GAA GTG GAT GTT GTA TTT ACT TCP, rar rar 
Asp Asp Tyr Ala Asp Glu Glu Val Asp Val vll Phe Thr Ser AsJ 
95 100 10 5 

AAA CAG CCT GAG CTT GAA TCT GAC GAG CAT GGA AAG ACC TTA CGG TTG 
Lys Gin Pro Glu Leu Glu Ser Asp Glu His Gly Lys Thr leu 5 ™ 

^ 2 0 

Thr 1^ GAG ^ T P A AGT GGA GAG 08 *** TCC CAG TGG CTT TCG 

Thr Ser Pro Glu Gly Leu Ser Gly Glu Gin Lys Ser Gin Trp Leu Ser 

130 135 140 

Thr tTT ??* S T ? GT ? ^ AGT GCC *** TAC TGG ^ T CTG GCA GAG 

Thr lie Lys Ala Val Val Gin Ser Ala Lys Tyr Trp Asn Leu Ala Glu 

145 150 155 

CVS tS SI pf IF l° G 2?* ^ GGG GTC MT ATG *** GAG CG ^ CAG 
Cys Thr Phe Glu Ala Ser Gly Glu Gly Val lie Met Lys Glu Arg Gin 

160 165 170 

ATA ACT CCG GAT GTA TAT AAG GTC ACT CCA GTG ATG AAC ACA CAT PPG 
He Thr Pro Asp Val Tyr Lys Val Thr Pro Val Met A^n Thr His Pro 
175 180 i 85 



2319 



2367 



2415 



2463 



2511 



2559 



2607 



2655 



2703 



2751 



220 
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TTG GAT GAA TTG TTC TCA TCT AGA GGA GAG TTC ATC TCT GTC GGA GGT 
Leu Asp Glu Leu Phe Ser Ser Arg Gly Glu Phe He Ser Val Gly Gly 

225 o *i n 



230 235 



GAC GGA CGA ATG TCT CAT AAA GAG GCC ATC CTG CTC GGC CTG AGA TAC 
Asp Gly Arg Met Ser His Lys Glu Ala lie Leu Leu Gly Leu Arg Tyr 

240 245 



250 



AAA AAG TTG TAC AAT CAG GCG AGA GTC AAA TAT TCT CTG TAG 
Lys Lys Leu Tyr Asn Gin Ala Arg Val Lys Tyr Ser Leu 
255 260 265 

• ACTATGAAAA AAAGTAACAG ATATCACGAT CTAAGTGTTA TCCCAATCCA TTCATC 

ATG AGT TCC TTA AAG AAG ATT CTC GGT CTG AAG GGG AAA GGT AAG AAA 
Met Ser Ser Leu Lys Lys He Leu Gly Leu Lys Gly Lys Gly Lys Lys 
1 5 10 15 

TCT AAG AAA TTA GGG ATC GCA CCA CCC CCT TAT GAA GAG GAC ACT AGC 
Ser Lys Lys Leu Gly lie Ala Pro Pro Pro Tyr Glu Glu Asp Thr Ser 

20 25 30 

ATG GAG TAT GCT CCG AGC GCT CCA ATT GAC AAA TCC TAT TTT GGA GTT 
Met Glu Tyr Ala Pro Ser Ala Pro lie Asp Lys Ser Tyr Phe Gly Val 
35 40 45 

GAC GAG ATG GAC ACC TAT GAT CCG AAT CAA TTA AGA TAT GAG AAA TTC 
Asp Glu Met Asp Thr Tyr Asp Pro Asn Gin Leu Arg Tyr Glu Lv£ Phe 

55 60 

£° IT SE* , GT ? *** ATG ACG GTT AGA TCT AAT CGT CCG TTC AGA ACA 
Phe Phe Thr Val Lys Met Thr Val Arg Ser Asn Arg Pro pAe Krg tS 

75 80 
TAC TCA GAT GTG GCA GCC GCT GTA TCC CAT TGG GaT CAC ATG TAC ATC 

Tyr Ser Asp Val Ala Ala Ala Val Ser His Trp Asp His Met t£ He 

85 9° 95 

GGA ATG GCA GGG AAA CGT CCC TTC TAC AAA ATC TTG GCT TTT TTG GTT 
Gly Met Ala Gly Lys Arg Pro Phe Tyr Lys He Leu a" Phe Leu G^y 

TCT TCT AAT CTA AAG GCC ACT CCA GCG GTA TTG GCA GAT CAA GGT CA1 
Ser ser Asn Leu Lys Ala Thr Pro Ala Val Leu Sa ^ Gly Gin 

120 125 

CCA GAG TAT CAC ACT CAC TGC GAA GGC AGG GCT TAT TTG CCA CAT Acr 
Pro Glu Tyr. His . Thr His Cys Glu Gly Arg Ala Tyr Leu Pro His 
XJU 135 140 

ATG GGG AAG ACC CCT CCC ATG CTC AAT GTA CCA GAG CAC TTC AGA AGA 
Met Gly Lys Thr Pro Pro Met Leu Asn Val Pro Glu His Phe j£J £g 

155 160 

CCA TTC AAT ATA GGT CTT TAC AAG GGA ACG ATT GAG CTC ACA ATG ACC 
Pro Phe Asn lie Gly Leu Tyr Lys Gly Thr He Glu Leu Thr Met Thr 

165 i 7 o 1?5 

ATC TAC GAT GAT GAG TCA CTG GAA GCA GCT CCT ATG ATC TGG GAT CAT 
He Tyr Asp Asp Glu Ser Leu Glu Ala Ala Pro Met lie Trp Asp His 

£f ^ T T CT TCC *** 7X7 TCT GAT TTC AGA GAG AAG GCC TTA ATG TTT 
Phe Asn Ser Ser Lys Phe Ser Asp Phe Arg Glu Lys Ala Leu Met P^e 

200 205 / 

GGC CTG ATT GTC GAG AAA AAG GCA TCT GGA GCG TGG GTC CTG GAT TCT 
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Gly Leu He Val Glu Lys Lys Ala Ser Gly Ala Trp Val Leu Asp Ser 
210 215 220 

ATC AGC CAC TTC AAA TGA GCTAGTCTAA CTTCTAGCTT CTGAACAATC ifi^ 
He Ser His Phe Lys '*'" 
22 5 

CCCGGTTTAC TCAGTCTCTC CTAATTCCAG CCTCTCGAAC AACTAATATC CTGTCTTTTC 3 725 

TATCCCTATG AAAAAAACTA ACAGAGATCG ATCTGTTTAC GCGTCACT ATG AAG' TGC 3782 

Met Lys Cys 
1 

CTT TTG TAC TTA GCC TTT TTA TTC ATT GGG GTG AAT TGC AAG TTC ACC 3 830 

Leu Leu Tyr Leu Ala Phe Leu Phe He Gly Val Asn Cys Lys Phe Thr 
5 10 15 

ATA GTT TTT CCA CAC AAC CAA AAA GGA AAC TGG AAA AAT GTT CCT TCT 
He Val Phe Pro His Asn Gin Lys Gly Asn Trp Lys Asn Val Pro Ser 
20 25 30 . 35 

AAT TAC CAT TAT TGC CCG TCA AGC TCA GAT TTA AAT TGG CAT AAT GAC 3926 
Asn Tyr His Tyr Cys Pro Ser Ser Ser Asp Leu Asn Trp His Asn Asp 

40 45 50 

TTA ATA GGC ACA GCC ATA CAA GTC AAA ATG CCC AAG AGT CAC AAG GCT 3 974 

Leu He Gly Thr Ala He Gin Val Lys Met Pro Lys Ser His Lys Ala 

55 60 65 

ATT CAA GCA GAC GGT TGG ATG TGT CAT GCT TCC AAA TGG GTC ACT ACT 4 022 

He Gin Ala Asp Gly Trp Met Cys His Ala Ser Lys Trp Val Thr Thr 
7 ° 75 80 

TGT GAT TVC '-CGC^TGo TAT GGA CCG AAG TAT ATA ACA CAG TCC ATC CGA 
Cys Asp Phe Arcf lTp Tyr Gly Pro Lys Tyr He Thr Gin Ser He Arq 
85 90 95 3 

TCC TTC ACT CCA TCT GTA GAA CAA TGC AAG GAA AGC ATT GAA CAA ACG 
Ser Phe Thr Pro Ser Val Glu Gin Cys Lys Glu Ser He Glu Gin Thr 
100 105 no 115 

AAA CAA GGA ACT TGG CTG AAT CCA GGC TTC CCT CCT CAA AGT TGT GGA 4166 
Lys Gin Gly Thr Trp Leu Asn Pro Gly Phe Pro Pro Gin Ser Cys Gly 

120 125 130 

TAT GCA ACT GTG ACG GAT GCC GAA GCA GTG ATT GTC CAG GTG ACT CCT 
Tyr Ala Thr Val Thr Asp Ala Glu Ala Val He Val Gin Val Thr Pro 

135 140 145 

CAC CAT GTG CTG GTT GAT GAA TAC ACA GGA GAA TGG GTT GAT TCA CAG* * 
His His Val Leu Val Asp Glu Tyr Thr Gly Glu Trp Val Asp Ser Gin 
150 155 160 ' 

TTC ATC AAC GGA AAA TGC AGC AAT TAC ATA TGC CCC ACT GTC CAT AAC 
Phe He Asn Gly Lys Cys Ser Asn Tyr He Cys Pro Thr Val His Asn 
165 170 " 175 



4070 



4118 



4214 



4262 



4310 



4406 



TCT ACA ACC TGG CAT TCT GAC TAT AAG GTC AAA GGG CTA TGT GAT TCT 43 5B 

Ser Thr Thr Trp His Ser Asp Tyr Lys Val Lys Gly Leu Cys Asp Ser " 
160 185 190 * 195 

AAC CTC ATT TCC ATG GAC ATC ACC TTC TTC TCA GAG GAC GGA GAG CTA 
Asn- Leu He Ser Met Asp He Thr Phe Phe Ser Glu Asp Gly Glu Leu 

200 205 " 210 

TCA TCC CTG GGA AAG GAG GGC ACA GGG TTC AGA AGT AAC TAC TTT GCT 
Ser Ser Leu Gly Lys Glu. Gly Thr Gly Phe Arg Ser Asn Tyr Phe Ala 
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215 220 225 

TAT GAA ACT GGA GGC AAG GCC TGC AAA ATG CAA TAC TGC AAG CAT TGG 4502 
Tyr Glu Thr Gly Gly Lys Ala Cys Lys Met Gin Tyr Cys Lys His Trp 
230 235 " 240 

GGA GTC AGA CTC CCA TCA GGT GTC TGG TTC GAG ATG GCT GAT AAG GAT 4550 
Gly Val Arg Leu Pro Ser Gly Val Trp Phe Glu Met Ala Asp Lys Asp 
245. 250 255 

CTC TTT GCT GCA GCC AGA TTC CCT GAA TGC CCA GAA GGG TCA AGT ATC 4598 
Leu Phe Ala Ala Ala Arg Phe Pro Glu Cys Pro Glu Gly Ser Ser lie 
260 265 270 275 

TCT GCT CCA TCT CAG ACC TCA GTG GAT.GTA AGT CTA ATT CAG GAC GTT 4646 
Ser Ala Pro Ser Gin Thr Ser Val Asp Val Ser Leu He Gin Asp Val 

280 285 290 

GAG AGG ATC TTG GAT TAT TCC CTC TGC CAA GAA ACC TGG AGC AAA ATC 4694 
Glu Arg He Leu Asp Tyr Ser Leu Cys Gin Glu Thr Trp Ser Lys He 

295 300 305 

AGA GCG GGT CTT CCA ATC TCT CCA GTG GAT CTC AGC TAT CTT GCT CCT 4 742 

Arg Ala Gly Leu Pro He Ser Pro Val Asp Leu Ser Tyr Leu Ala Pro 
310 315 320 

AAA AAC CCA GGA ACC GGT CCT GCT TTC ACC ATA ATC AAT GGT ACC CTA 4790 
Lys Asn Pro Gly Thr Gly Pro Ala Phe Thr He He Asn Gly Thr Leu 
325 .330 335 

AAA TAC TTT GAG ACC AGA TAC ATC AGA GTC GAT ATT GCT GCT CCA ATC 4838 
, Lys Tyr Phe Glu Thr Arg Tyr He Arg Val Asp He Ala Ala Pro He 
340 345 350 355 

CTC TCA AGA ATG GTC GGA ATG ATC AGT GGA ACT ACC ACA GAA AGG GAA 4 886 

Leu Ser Arg Met Val Gly Met He Ser Gly Thr Thr Thr Glu Arg Glu 

360 365 370 

CTG TGG GAT GAC TGG GCA CCA TAT GAA GAC GTG GAA ATT GGA CCC AAT 4934 
Leu Trp Asp Asp Trp Ala Pro Tyr Glu Asp Val Glu He Gly Pro Asn 

375 380 385 

GGA GTT CTG AGG ACC AGT TCA GGA TAT AAG TTT CCT TTA TAC ATG ATT 4 982 

Gly Val Leu Arg Thr Ser Ser Gly Tyr Lys Phe Pro Leu Tyr Met lie 
390 395 400 

GGA CAT GGT ATG TTG GAC TCC GAT CTT CAT CTT AGC TCA AAG GCT CAG 5030 
Gly His Gly v Met Leu Asp Ser Asp Leu His Leu Ser Ser Lys Ala Gin 
405 410 415 

GTG TTC GAA CAT CCT CAC ATT CAA GAC GCT GCT TCG CAA CTT CCT GAT 5078 
Val Phe Glu His Pro His He Gin Asp Ala Ala Ser Gin Leu Pro Asp 
420 425 430 435 

GAT GAG AGT TTA TTT TTT GGT GAT ACT GGG CTA TCC AAA AAT CCA ATC 5126 
Asp Glu Ser Leu Phe Phe Gly Asp Thr Gly Leu Ser Lys Asn Pro He 

440 445 " 450 

GAG CTT GTA GAA GGT TGG TTC AGT AGT TGG AAA AGC TCT ATT GCC TCT 5174 
Glu Leu Val Glu Gly Trp Phe Ser Ser Trp Lys Ser Ser He Ala Ser 

455 460 465 

TTT TTC TTT ATC ATA GGG TTA ATC ATT GGA CTA TTC TTG GTT CTC CGA 5222 
Phe Phe Phe He He Gly Leu ' He He Gly Leu Phe Leu Val Leu Arg 
470 475 480 

GTT GGT ATC CAT CTT TGC ATT" AAA TTA AAG CAC ACC AAG AAA AGA CAG 52 70 
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Val Gly He His Leu Cys lie Lys Leu Lys His Thr Lys Lys Arg Gin 
485 490 ^ 495 

ATT TAT ACA GAC ATA GAG ATG AAC CGA CTT GGA AAG TAA CTCAAATCCT 5319 
He Tyr Thr Asp He Glu Met Asn Arg Leu Gly Lys 
500 505 ~ 510 - 

G CTAGCCAG A TTCTTCATGT TTGGACCAAA TCAACTTGTG ATACCATGCT CAAAGAGGCC 5379 

TCAATTATAT TTGAGTTTTT AATTTTTATG AAAAAAACTA ACAGCAATC ATG GAA 5434 

Met Glu 
1 

GTC CAC GAT TTT GAG ACC GAC GAG TTC AAT GAT TTC AAT GAA GAT GAC 54 82 

Val His Asp Phe Glu Thr Asp Glu Phe Asn Asp Phe Asn Glu Asp Asp 
5 10 15 

TAT GCC ACA AGA GAA TTC CTG AAT CCC GAT GAG CGC ATG ACG TAC TTG 553 0 

Tyr Ala Thr Arg Glu Phe Leu Asn Pro Asp Glu Arg Met Thr Tyr Leu 
20 25 30 

AAT CAT GCT GAT TAC AAT TTG AAT TCT CCT CTA ATT AGT GAT GAT ATT 5 57 8 

Asn His Ala Asp Tyr Asn Leu Asn Ser Pro Leu He Ser Asp Asp He 
35 40 45 " 50 

GAC AAT TTG ATC AGG AAA TTC AAT TCT CTT CCG ATT CCC TCG ATG TGG 5626 
Asp Asn Leu He Arg Lys Phe Asn Ser Leu Pro He Pro Ser Met Trp 

55 60 65 

GAT AGT AAG AAC TGG GAT GGA GTT CTT ' GAG ATG TTA ACA TCA TGT CAA 5674 
Asp Ser Lys Asn Trp Asp Gly Val Leu Glu Met Leu Thr Ser Cys Gin 

70 75 80 

GCC AAT CCC AT- TCA ACA TCT CAG ATG CAT AAA TGG ATG GGA AGT TGG 5722 
Ala Asn Pro He Ser xiir Ser Gin Met His Lys Trp Met Gly Ser Trp 
85 90 " 95 

TTA ATG TCT GAT AAT CAT GAT GCC AGT CAA GGG TAT AGT TTT TTA CAT ' 5770 

Leu Met Ser Asp Asn His Asp Ala Ser Gin Gly Tyr Ser Phe Leu His 
100 105 no 

GAA GTG GAC AAA GAG GCA GAA ATA ACA TTT GAC GTG GTG GAG ACC TTC 5818 
Glu Val Asp Lys Glu Ala Glu He Thr Phe Asp Val Val Glu Thr Phe 
115 120 125 130 

ATC CGC GGC TGG GGC AAC AAA CCA ATT GAA TAC ATC AAA AAG GAA AGA 5866 
He Arg Gly Trp Gly Asn Lys Pro lie Glu Tyr He Lys Lys Glu Arg 

135 140 " 145 

TGG ACT GAC TCA TTC AAA ATT CTC GCT TAT TTG TGT CAA AAG TTT TTG 5914 
Trp Thr Asp Ser Phe Lys lie Leu Ala Tyr Leu Cys Gin Lys Phe Leu 

150 155 * 160 

GAC TTA CAC AAG TTG ACA TTA ATC TTA AAT GCT GTC TCT GAG GTG GAA 5962 
Asp Leu His Lys Leu Thr Leu lie Leu Asn Ala Val Ser Glu Val Glu 
165 170 175 

TTG CTC AAC TTG GCG AGG ACT TTC AAA GGC AAA GTC AGA AGA AGT TCT 6010 
Leu Leu Asn Leu Ala Arg Thr Phe Lys Gly Lys Val Arg Arg Ser Ser 
180 185 190 

CAT GGA ACG AAC ATA TGC AGG ATT AGG GTT CCC AGC TTG GGT CCT ACT 6 058 

His Gly Thr Asn lie Cys Arg lie Arg Val Pro Ser Leu Gly Pro Thr 

200 205 - 210 

TTT ATT TCA GAA GGA TGG GCT TAC TTC AAG AAA CTT GAT ATT CTA ATG 6106 
Phe He Ser Glu Gly Trp Ala Tyr Phe Lys Lys Leu Asp He Leu Met 



- 79 - 



BNSDOCID: <WO 



9634€25A1_IA> 



WO 96/34625 PCT/US96/06053 



215 220 225 

GAC CGA AAC TTT CTG TTA ATG GTC AAA GAT GTG ATT ATA GGG AGG ATG 6154 
Asp Arg Asn Phe Leu Leu Met Val Lys Asp Val He He Gly Arg Met ■ 

230 235 240 

CAA ACG GTG CTA TCC ATG GTA TGT AGA ATA GAC AAC CTG TTC TCA GAG 62 02 

Gin Thr Val Leu Ser Met Val Cys Arg He Asp Asn Leu Phe Ser Glu 
245 250 255 

CAA GAC ATC TTC TCC CTT CTA AAT ATC TAC AGA ATT GGA GAT AAA ATT 6250 
Gin Asp He Phe Ser Leu Leu Asn He Tyr Arg He Gly Asp Lys He 
260 265 270 

GTG GAG AGG CAG GGA AAT TTT TCT TAT GAC TTG ATT AAA ATG GTG GAA 6298 
Val Glu Arg Gin Gly Asn Phe Ser Tyr Asp Leu He Lys Met Val Glu 
275 2B0 285 290 

CCG ATA TGC AAC TTG AAG CTG ATG AAA TTA GCA AGA GAA TCA AGG CCT 6346 
Pro He Cys Asn Leu Lys Leu Met Lys Leu Ala Arg Glu Ser Arg Pro 

295 ' 300 ~ 305 

TTA GTC CCA CAA TTC CCT CAT TTT GAA AAT CAT ATC AAG ACT TCT GTT 6394 
Leu Val Pro Gin Phe Pro His Phe Glu Asn His He Lys Thr Ser Val 

310 315 " 320 

GAT GAA GGG GCA AAA ATT GAC CGA GGT ATA AGA TTC CTC CAT GAT CAG 6442 
Asp Glu Gly Ala Lys He Asp Arg Gly He Arg Phe Leu His Asp Gin 
325 330 335 

ATA ATG AGT GTG AAA ACA GTG GAT CTC ACA CTG GTG ATT TAT GGA TCG 6490 
.He Met Ser Val Lys Thr Val Asp Leu Thr Leu Val He Tyr Gly Ser 
340 ' 345 350 

TTC AGA CAT TGG GGT CAT CCT TTT ATA GAT TAT TAC ACT GGA CTA GAA 6538 
Phe Arg His Trp Gly His Pro Phe He Asp Tyr Tyr Thr Gly Leu Glu 
355 360 365 - 370 

AAA TTA CAT TCC CAA GTA ACC ATG AAG AAA GAT ATT GAT GTG TCA TAT -6586 
Lys Leu His Ser Gin Val Thr Met Lys Lys Asp He Asp Val Ser Tyr 

375 380 * 385 

GCA AAA GCA CTT GCA AGT GAT TTA GCT CGG ATT GTT CTA TTT CAA CAG 6634 
Ala Lys Ala Leu Ala Ser Asp Leu Ala Arg He Val Leu Phe Gin Gin 

390 395 ~ 400 

TTC AAT GAT CAT AAA AAG TGG TTC GTG AAT GGA GAC TTG CTC CCT CAT 6682 
Phe Asn Asp His Lys Lys Trp Phe Val Asn Gly Asp Leu Leu Pro His 
405 410 415 

GAT CAT CCC TTT AAA AGT CAT GTT AAA GAA AAT ACA TGG CCC ACA GCT 6730 
Asp His Pro Phe Lys Ser His Val Lys Glu Asn Thr Trp Pro Thr Ala 
420 425 430 

GCT CAA GTT CAA GAT TTT GGA GAT AAA TGG CAT GAA CTT CCG CTG ATT 6778 
Ala Gin Val Gin Asp Phe Gly Asp Lys Trp His Glu Leu Pro Leu He 
435 440 445 450 

AAA TGT TTT GAA ATA CCC GAC TTA CTA GAC CCA TCG ATA ATA TAC TCT 6826 
Lys Cys Phe Glu He Pro Asp Leu Leu Asp Pro Ser He He Tyr Ser 

455 460 465 

GAC AAA AGT CAT TCA ATG AAT AGG TCA GAG GTG TTG AAA CAT GTC CGA 6 874 

Asp Lys Ser His Ser Met Asn Arg Ser Glu Val Leu Lys His Val Arg 

470 475 480 

ATG AAT CCG AAC ACT CCT ATC CCT AGT AAA AAG GTG TTG CAG ACT ATG 6 922 
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Met Asn Pro Asn Thr Pro He Pro Ser Lys Lys Val Leu Gin Thr Met 
4 85 4 90 - - 

TTG GAC ACA AAG GCT ACC AAT- TGG AAA GAA TTT CTT AAA GAG ATT GAT 6 97 0 

Leu Asp Thr Lys Ala Thr Asn Trp Lys Glu Phe Leu Lys Glu He Asp 
500 505 510 

GAG AAG GGC TTA GAT GAT GAT GAT CTA ATT ATT GGT CTT AAA GGA AAG 7 0 IB 

Glu Lys Gly Leu Asp Asp Asp Asp Leu He He Gly Leu Lys Gly Lys 
515 520 525 530 

GAG AGG GAA CTG AAG TTG GCA GGT AGA TTT TTC TCC CTA ATG TCT TGG 7066 
Glu Arg Glu Leu Lys Leu Ala Gly Arg Phe Phe Ser Leu Met Ser Trp 

535 " 540 545 

AAA TTG CGA GAA TAC TTT GTA ATT ACC GAA TAT TTG ATA AAG ACT CAT 7114 
Lys Leu Arg Glu Tyr Phe Val lie Thr Glu Tyr Leu He Lys Thr His 

550 555 " 560 

TTC GTC CCT ATG TTT AAA GGC CTG ACA ATG GCG GAC GAT CTA ACT GCA 7162 
Phe Val Pro Met Phe Lys Gly Leu Thr Met Ala Asp Asp Leu Thr Ala 
565 570 575 

GTC ATT AAA AAG ATG TTA GAT TCC TCA TCC GGC CAA GGA' TTG AAG TCA 7210 
Val He Lys Lys Met Leu Asp Ser Ser Ser Gly Gin Gly Leu Lys Ser 
580 585 590 

TAT GAG GCA ATT TGC ATA GCC AAT CAC ATT GAT TAC GAA AAA TGG AAT .72 58 

Tyr Glu Ala He Cys lie Ala Asn His He Asp Tyr Glu Lys Trp Asn 
595 600 605 610 

AAC CAC CAA AGG AAG TTA TCA AAC GGC CCA GTG TTC CGA GTT ATG GGC 7306 
Asn His Gin Arg Lys Leu Ser Asn Gly Pro Val Phe Arg Val Met Gly 

6!i-i 'I:- .620 625 

CAG TTC TTA GGT TAT CCA TCC TTA ATC GAG AGA ACT CAT GAA TTT TTT 7354 
Gin Phe Leu Gly Tyr Pro Ser Leu He Glu Arg Thr His Glu Phe Phe 

630 635 ~ 640 

GAG AAA AGT CTT ATA TAC TAC AAT GGA AGA CCA GAC TTG ATG CGT GTT- 74 02 

Glu Lys Ser Leu He Tyr Tyr Asn Gly Arg Pro Asp Leu Met Arg Val 
645 650 " ' 655 

CAC AAC AAC ACA CTG ATC AAT TCA ACC TCC CAA CGA GTT TGT TGG CAA 74 50 

His Asn Asn Thr Leu He Asn Ser Thr Ser Gin Arg Val Cys Trp Gin 
660 665 670 

GGA CAA GAG GGT GGA CTG GAA GGT CTA CGG CAA AAA GGA TGG ACT ATC 74 98 

Gly Gin Glu Gly Gly Leu Glu Gly Leu Arg Gin Lys Gly Trp Thr He 
675 680 685 " ' 690 

CTC AAT CTA CTG GTT ATT CAA AGA GAG GCT AAA ATC AGA AAC ACT GCT 7546 
Leu Asn Leu Leu Val He Gin Arg Glu Ala Lys He Arg Asn Thr Ala 

695 700 ~ 705 

GTC AAA GTC TTG GCA CAA GGT GAT AAT CAA GTT ATT TGC ACA CAG TAT 7594 
Val Lys Val Leu Ala Gin Gly Asp Asn Gin Val He Cys Thr Gin Tyr 

710 715 ' 720 

AAA ACG AAG AAA TCG AGA AAC GTT GTA GAA TTA CAG GGT GCT CTC AAT 7642 
Lys Thr Lys Lys Ser Arg Asn Val Val Glu Leu Gin Gly Ala Leu Asn 
725 730 735 

CAA ATG GTT TCT AAT AAT GAG AAA ATT ATG ACT GCA ATC AAA ATA GGG 7690 
Gin Met Val Ser Asn Asn Glu Lys He Met Thr Ala He Lys He Gly 
740 745 750 
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ACA GGG AAG TTA GGA CTT TTG ATA AAT GAC GAT GAG ACT ATG CAA TCT 77 38 

Thr Gly Lys Leu Gly Leu Leu He Asn Asp Asp Glu Thr Met Gin Ser 
755 . 760 765 770 

GCA GAT TAC TTG AAT TAT GGA AAA ATA CCG ATT TTC CGT GGA GTG ATT 7786 
Ala Asp Tyr Leu Asn Tyr Gly Lys He Pro He Phe Arg Gly Val He 

775 780 785 

AGA GGG TTA GAG ACC AAG AGA TGG TCA CGA GTG ACT TGT GTC ACC AAT 7834 
Arg Gly Leu. Glu Thr Lys Arg Trp Ser Arg Val Thr Cys Val Thr Asn 

790 795 " 800 

GAC CAA ATA CCC ACT TGT GCT AAT ATA ATG AGC TCA GTT TCC ACA AAT 7882 
Asp Gin lie .Pro Thr Cys Ala Asn lie Met Ser Ser Val Ser Thr Asn 
805 810 815 

GCT CTC ACC GTA GCT CAT TTT GCT GAG AAC CCA ATC AAT GCC ATG ATA 7930 
Ala Leu Thr Val Ala His Phe Ala Glu Asn Pro He Asn Ala Met He 
820 825 830 

CAG TAC AAT TAT TTT GGG ACA TTT GCT AGA CTC TTG TTG ATG ATG CAT 7978 
Gin Tyr Asn Tyr Phe Gly Thr Phe Ala Arg Leu Leu Leu Met Met His 
835 840 845 850 

GAT CCT GCT CTT CGT CAA TCA TTG TAT GAA GTT CAA GAT AAG ATA CCG 8026 
Asp Pro Ala Leu Arg Gin Ser Leu Tyr Glu Val Gin Asp Lys He Pro 

855 860 . 865 

GGC TTG CAC AGT TCT ACT TTC AAA TAC GCC ATG TTG TAT TTG GAC CCT 8074 
Gly Leu His Ser Ser Thr Phe Lys Tyr Ala Met Leu Tyr Leu Asp Pro 

870 .875 ' 880 

TCC ATT GGA GGA GTG TCG GGC ATG TCT TTG TCC AGG TTT TTG ATT AGA 8122 
Ser He Gly Gly Val Ser Gly Met Ser Lev; Ser Arg Phe Leu. He' iu. 7 
885 890 895 

GCC TTC CCA GAT CCC GTA ACA GAA AGT CTC TCA TTC TGG AGA TTC ATC 8170 
Ala Phe Pro Asp Pro Val Thr Glu Ser Leu Ser Phe Trp Arg Phe lie « 
900 905 910 

CAT GTA CAT GCT CGA AGT GAG CAT CTG AAG GAG ATG AGT GCA GTA TTT 8218 
His Val His Ala Arg Ser Glu His Leu Lys Glu Met Ser Ala Val Phe 
915 920 925 930 

GGA AAC CCC GAG ATA GCC AAG TTT CGA ATA ACT CAC ATA GAC AAG CTA 8266 
Gly Asn Pro Glu He Ala Lys Phe Arg lie Thr His lie Asp Lys Leu 

935 940 945 

GTA GAA GAT CCA ACC TCT CTG AAC ATC GCT ATG GGA ATG AGT CCA GCG 8314 
val Glu Asp Pro Thr Ser Leu Asn lie Ala Met Gly Met Ser Pro Ala 

950 955 * 960 

AAC TTG TTA AAG ACT GAG GTT AAA AAA TGC TTA ATC GAA TCA AGA CAA 8362 
Asn Leu Leu Lys Thr Glu Val Lys Lys Cys Leu lie Glu Ser Arg Gin " " 
965 970 975 

ACC ATC AGG AAC CAG GTG ATT AAG GAT GCA ACC ATA TAT TTG TAT CAT 8410 
Thr lie Arg Asn Gin Val lie Lys Asp Ala Thr He Tyr Leu Tyr His 
980 985 990 

GAA GAG GAT CGG CTC AGA. AGT TTC TTA TGG TCA ATA AAT CCT CTG TTC 84 58 

Glu Glu Asp Arg Leu Arg Ser Phe Leu Trp Ser lie Asn Pro Leu Phe 
995 1000 1005 1010 

CCT AGA TTT TTA AGT GAA TTC AAA TCA GGC ACT TTT TTG GGA GTC GCA 8506 
Pro Arg Phe Leu Ser Glu Phe Lys Ser Gly Thr Phe Leu Gly Val Ala 

10 15 1020 1025 
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GAC GGG CTC ATC AGT CTA TTT CAA AAT TCT CGT ACT ATT CGG AAC TCC 85 54 

Asp Gly Leu lie Ser Leu Phe Gin Asn Ser Arg Thr lie Arg Asn Ser 

1030 1035 1040 

TTT AAG AAA AAG TAT CAT AGG GAA TTG GAT GAT TTG ATT GTG AGG AGT 86 02 

Phe Lys Lys Lys Tyr His Arg Glu Leu Asp Asp Leu lie Val Arg Ser 
1045 1050 * 1055 

GAG GTA TCC TCT TTG ACA CAT TTA GGG AAA CTT CAT TTG AGA AGG GGA 8650 
Glu Val Ser Ser Leu Thr His Leu Gly Lys Leu His Leu Arg Arg Gly 
1060 1065 * 1070 

TCA TGT AAA ATG TGG ACA TGT TCA GCT ACT CAT GCT GAC ACA TTA AGA 86 98 

Ser Cys Lys Met Trp Thr Cys Ser Ala Thr His Ala Asp Thr Leu Arg 
1075 1080 1085 1090 

TAC AAA TCC TGG GGC CGT ACA GTT ATT GGG ACA ACT GTA CCC CAT CCA 8 746 

Tyr Lys Ser Trp Gly Arg Thr Val He Gly Thr Thr Val Pro His Pro 

1095 1100 ' 1105 

TTA GAA ATG TTG GGT CCA CAA CAT CGA AAA GAG ACT CCT TGT GCA CCA 8794 
Leu Glu Met Leu Gly Pro Gin His Arg Lys Glu Thr Pro Cys Ala Pro 

1110 ins 1120 

TGT AAC ACA TCA GGG TTC AAT TAT GTT TCT GTG CAT TGT CCA GAC GGG 8 842 

Cys Asn Thr Ser Gly Phe Asn Tyr Val Ser Val His Cys Pro Asp Gly 
1125 1130 H35 

ATC CAT GAC GTC TTT AGT TCA CGG GGA CCA TTG CCT GCT TAT CTA GGG 88 90 

He His Asp Val Phe Ser Ser Arg Gly Pro Leu Pro Ala Tyr Leu Gly 
1140 1145 1150. 

TCT AAA ACA TCT GAA TCT ACA TCT ATT TTG CAG CCT TGG GAA AGG GAA 8938 
Ser Lys Thr Ser Glu S^r .Thr Ser lie Leu Gin Pro Trp Glu Arg Glu 
H55 1160 1165 1170 

AGC AAA GTC CCA CTG ATT AAA AGA GCT ACA CGT CTT AGA GAT GCT ATC 8986 
Ser Lys Val Pro Leu He Lys Arg Ala Thr Arg Leu Arg Asp Ala He 

1175 1180 ~ ^ H85 

t 

TCT TGG TTT GTT GAA CCC GAC TCT AAA CTA GCA ATG ACT ATA CTT TCT 9034 
Ser Trp Phe Val Glu Pro Asp Ser Lys Leu Ala Met Thr He Leu Ser 

1190 1195 1200 

AAC ATC CAC TCT TTA ACA GGC GAA GAA TGG ACC AAA AGG CAG CAT GGG 9082 
Asn He His Ser Leu Thr Gly Glu Glu Trp Thr Lys Arg Gin His Gly 
1205 1210 ~ 1215 

TTC AAA AGA ACA GGG TCT GCC CTT CAT AGG TTT TCG ACA TCT CGG ATG 9130 
Phe Lys Arg Thr Gly Ser Ala Leu His Arg Phe Ser Thr Ser Arg Met 
1220 1225 1230 

AGC CAT GGT GGG TTC GCA TCT CAG AGC ACT GCA GCA TTG ACC AGG TTG 9178 
Ser His Gly Gly Phe Ala Ser Gin Ser Thr Ala Ala Leu Thr Arg Leu 
1235 1240 1245 1250 

ATG GCA ACT ACA GAC ACC ATG AGG GAT CTG GGA GAT CAG AAT TTC GAC 9226 
Met Ala Thr Thr Asp Thr Met Arg Asp Leu Gly Asp Gin Asn Phe Asp 

1255 1260 1265 

TTT TTA TTC CAA GCA ACG TTG CTC TAT GCT CAA ATT ACC ACC ACT GTT 92 74 

Phe Leu Phe Gin Ala Thr Leu Leu Tyr Ala Gin He Thr Thr Thr Val 

1270 . 1275 1280 

GCA AGA GAC GGA TGG ATC ACC AGT TGT ACA GAT CAT TAT CAT ATT GCC 9322 
Ala Arg Asp Gly Trp He Thr Ser Cys Thr Asp His Tyr His He Ala . 
1285 1290 1295 
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TGT AAG TCC TGT TTG AGA CCC ATA GAA GAG ATC ACC CTG GAC TCA AGT 93 7 0 

Cys Lys Ser Cys Leu Arg Pro lie Glu Glu lie Thr Leu Asp Ser Ser 
1300 1305 13 io 

ATG GAC TAC ACG CCC CCA GAT GTA TCC CAT GTG CTG AAG ACA TGG AGG 9418 
Met Asp Tyr Thr Pro Pro Asp Val Ser His Val Leu Lys Thr Trp Arq 
1315 1320 1325 i 3 5o 

AAT GGG GAA GGT TCG TGG GGA CAA GAG ATA AAA CAG ATC TAT CCT TTA 9466 
Asn Gly Glu Gly Ser Trp Gly Gin Glu He Lys Gin He Tyr Pro Leu 

1335 1340 1345 

GAA GGG AAT TGG AAG AAT TTA GCA CCT GCT GAG CAA TCC TAT CAA GTC 9514 
. Glu Gly Asn Trp Lys Asn Leu Ala Pro Ala Glu Gin Ser Tyr Gin Val 

1350 1355 1360 

GGC AGA TGT ATA GGT TTT CTA TAT GGA GAC TTG GCG TAT AGA AAA TCT 9562 
Gly Arg Cys He Gly Phe Leu Tyr Gly Asp Leu Ala Tyr Arg Lys Ser 
1365 1370 1375 

ACT CAT GCC GAG GAC AGT TCT CTA TTT CCT CTA TCT ATA CAA GGT CGT 9610 
Thr His Ala Glu Asp Ser Ser Leu Phe Pro Leu Ser He Gin Gly Arq 
1380 1385 1390 

ATT AGA GGT CGA GGT TTC TTA AAA GGG TTG CTA GAC GGA TTA ATG AGA 9658 
He Arg Gly Arg Gly Phe Leu Lys Gly Leu Leu Asp Gly Leu Met Arq 
1395 1400 1405 1410 

GCA AGT TGC TGC CAA GTA ATA CAC CGG AGA AGT CTG GCT CAT TTG AAG 9706 
Ala Ser Cys Cys Gin Val He His Arg Arg Ser Leu Ala His Leu Lys 

1415 1420 1425 

AGG CCG GCC AAC GCA GTG TAC GGA GGT TTG ATT TAC TTG ATT GAT AAA 9754 
Arg Pro Ala Asn Ala Val Tyr Gly Gly Leu He Tyr Leu He Asp Z'/b ' 

1430 1435 1440 * 

TTG AGT GTA TCA CCT CCA TTC CTT TCT CTT ACT AGA TCA GGA CCT ATT 9802 
Leu Ser Val Ser Pro Pro Phe Leu Ser Leu Thr Arg Ser Gly Pro He " 
1445 1450 ^1455 

AGA GAC GAA TTA GAA ACG ATT CCC CAC AAG ATC CCA ACC TCC TAT CCG 985 0 

Arg Asp Glu Leu Glu Thr He Pro His Lys He Pro Thr Ser Tyr Pro 
1460 1465 1470 

ACA AGC AAC CGT GAT ATG GGG GTG ATT GTC AGA AAT TAC TTC AAA TAC 98 98 

Thr Ser Asn Arg Asp Met Gly Val lie Val Arg Asn Tyr Phe Lys Tvr 
1475 1480 1485 1490 

CAA TGC CGT CTA ATT GAA AAG GGA AAA TAC AGA TCA CAT TAT TCA CAA 994 6 

Gin Cys Arg Leu He Glu Lys Gly Lys Tyr Arg Ser His Tyr Ser Gin 

1495 1500 * 1505 

TTA TGG TTA TTC TCA GAT GTC TTA TCC ATA GAC TTC ATT GGA CCA TTC 9994 
Leu Trp Leu Phe Ser Asp Val Leu Ser He Asp Phe He Gly Pro Phe 

1510 1515 " 1520 

TCT ATT TCC ACC ACC CTC TTG CAA ATC CTA TAC AAG CCA TTT TTA TCT 10042 
Ser He Ser Thr Thr Leu Leu Gin He Leu Tyr Lys Pro Phe Leu Ser 
1525- 1530 1535 

GGG AAA GAT AAG AAT GAG TTG AGA GAG CTG GCA AAT CTT TCT TCA TTG 10090 
Gly Lys Asp Lys Asn Glu Leu Arg Glu Leu Ala Asn Leu Ser Ser Leu 
1540 1545 1550 

CTA AGA TCA GGA GAG GGG TGG GAA GAC ATA CAT GTG AAA TTC TTC ACC 1013 8 

Leu Arg Ser Gly Glu Gly Trp Glu Asp He His Val Lys Phe Phe Thr 
1555 1560 i5 6 5 1570 
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AAG GAC ATA TTA TTG TGT CCA GAG GAA ATC AGA CAT GCT TGC AAG TTC 10186 
Lys Asp lie Leu Leu Cys Pro Glu Glu lie Arg His Ala Cys Lys Phe 

1575 1580 ~ 1585 

GGG ATT GCT AAG GAT AAT AAT AAA GAC ATG AGC TAT CCC CCT TGG GGA 10234 
Gly lie Ala Lys Asp Asn Asn Lys Asp Met Ser Tyr Pro Pro Trp Gly 

1590 " 1595 ' 1600 

AGG GAA TCC AGA GGG ACA ATT ACA ACA ATC CCT GTT TAT TAT ACG ACC • 10282 
Arg Glu Ser Arg Gly Thr lie Thr Thr lie Pro Val Tyr Tyr Thr Thr 
1605 1610 1615 

ACC CCT TAC CCA AAG ATG CTA GAG ATG CCT CCA AGA ATC CAA AAT CCC 10330 
Thr Pro Tyr Pro Lys Met Leu Glu Met Pro Pro Arg lie Gin Asn Pro 
1620 1625 1630 

CTG CTG TCC GGA ATC AGG TTG GGC CAA TTA CCA ACT GGC GCT CAT TAT 10378 
Leu Leu Ser Gly lie Arg Leu Gly Gin Leu Pro Thr Gly Ala His Tyr 
1635 1640 1645 " 1650 

AAA ATT CGG AGT ATA TTA CAT GGA ATG GGA ATC CAT TAC AGG GAC TTC 10426 
Lys lie Arg Ser lie Leu His Gly Met Gly lie His Tyr Arg Asp Phe 

1655 1660 " " 1665 

TTG AGT TGT GGA GAC GGC TCC GGA GGG ATG ACT GCT GCA TTA CTA CGA 104 74 

Leu Ser Cys Gly Asp Gly Ser Gly Gly Met Thr Ala Ala Leu Leu Arg 

1670 1675 1680 

GAA AAT GTG CAT AGC AGA GGA ATA TTC AAT AGT CTG TTA GAA TTA TCA 10522 
Glu Asn Val His Ser Arg Gly lie Phe Asn Ser Leu Leu Glu Leu Ser 
1685 1690 1695 

GGG TCA GTC ATG CGA GGC GCC TCT CCT GAG CCC CCC AGT GCC CTA GAA 10570 
Gly Ser Val Met Arg Gly A3 a Ser Pxu ^lu Pro Pro Ser Ala Leu Glu 
1700 IV 05 1710 

ACT TTA GGA GGA GAT AAA TCG AGA TGT GTA AAT GGT GAA ACA TGT TGG 10618 
Thr Leu Gly Gly Asp Lys Ser Arg Cys Val Asn Gly Glu Thr Cys Trp 
1715 1720 1725 " 1730 

GAA TAT CCA TCT GAC TTA TGT GAC CCA AGG ACT TGG GAC TAT TTC CTC 10666 
Glu Tyr Pro Ser Asp Leu Cys Asp Pro Arg Thr Trp Asp Tyr Phe Leu 

1735 ~ 1740 " ' 1745 . 

CGA CTC AAA GCA GGC TTG GGG CTT CAA ATT GAT TTA ATT GTA ATG GAT 10714 
Arg Leu. Lys Ala Gly Leu Gly Leu Gin lie Asp Leu lie Val Met Asp 

1750 1755 1760 

ATG GAA GTT CGG GAT TCT TCT ACT AGC CTG AAA ATT GAG ACG AAT GTT 10762 
Met Glu Val Arg Asp Ser Ser Thr Ser Leu Lys lie Glu Thr Asn Val 
1765 1770 1775 

AGA AAT TAT GTG CAC CGG ATT TTG GAT GAG CAA GGA GTT TTA ATC TAC 10810 
Arg Asn Tyr Val His Arg lie Leu Asp Glu Gin Gly Val Leu lie Tyr 
1780 1785 * 1790 

AAG ACT TAT GGA ACA TAT ATT TGT GAG AGC GAA AAG AAT GCA GTA ACA 10858 
Lys Thr Tyr Gly Thr Tyr lie Cys Glu Ser Glu Lys Asn Ala Val Thr 
1795 1800 1805 1810 

ATC CTT GGT CCC ATG TTC AAG ACG GTC GAC TTA GTT CAA ACA GAA TTT 109 06 

lie Leu Gly Pro Met Phe Lys Thr Val Asp Leu Val Gin Thr Glu Phe 

1815 1820 1825 

AGT AGT TCT CAA ACG TCT GAA GTA TAT ATG GTA TGT AAA GGT TTG AAG 10954 
Ser Ser Ser Gin Thr Ser Glu Val Tyr Met Val Cys Lys Gly Leu Lys 

183 0 183 5 ' 184 0 
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AAA TTA ATC GAT GAA CCC AAT CCC GAT TGG TCT TCC ATC AAT GAA TCC 
Lys Leu lie Asp Glu Pro Asn Pro Asp Trp Ser Ser lie Asn Glu Ser 
1845 1850 1855 

TGG AAA AAC CTG TAC GCA TTC CAG TCA TCA GAA CAG GAA TTT GCC AGA 
Trp Lys Asn Leu Tyr Ala Phe Gin Ser Ser Glu Gin Glu Phe Ala Arg 
I860 1865 1870 

GCA AAG AAG GTT AGT ACA TAC TTT ACC TTG ACA GGT ATT CCC TCC CAA 
Ala Lys Lys Val Ser Thr Tyr Phe Thr Leu Thr Gly He Pro Ser Gin 
1875 1880 1885 " 1890 

TTC ATT CCT GAT CCT TTT GTA AAC ATT GAG ACT ATG CTA CAA ATA TTC 
Phe He Pro Asp Pro Phe Val Asn He Glu Thr Met Leu Gin He Phe 

1895 1900 1905 

GGA GTA CCC ACG GGT GTG TCT CAT GCG GCT GCC TTA AAA TCA TCT GAT 
Gly Val Pro Thr Gly Val Ser His Ala Ala Ala Leu Lys Ser Ser Asp 

1910 1915 " 1920 

AGA CCT GCA GAT TTA TTG ACC ATT AGC CTT TTT TAT ATG GCG ATT ATA 
Arg Pro Ala Asp Leu Leu Thr lie Ser Leu Phe Tyr Met Ala lie lie 
1925 1930 " 1935 

. TCG TAT TAT AAC ATC AAT CAT ATC AGA GTA GGA CCG ATA CCT CCG AAC 
Ser Tyr Tyr Asn He Asn His He Arg Val Gly Pro lie Pro Pro Asn 
1940 1945 1950 

CCC CCA TCA GAT GGA ATT GCA CAA AAT GTG GGG ATC GCT ATA ACT GGT 
Pro Pro Ser Asp Gly He Ala Gin Asn Val Gly He Ala lie Thr Gly 
1955 i960 1965 1970 

ATA AGC TTT TGG CTG AGT TTG ATG GAG AAA GAC ATT CCA CTA TAT CAA 
^ He Ser Phe Trp Leu Ser Leu Met Glu Lys Asp He Pro Leu Tyr Gin 

1975 1980 1985 

CAG TGT TTA GCA GTT ATC CAG CAA TCA TTC CCG ATT AGG TGG GAG GCT 
Gin Cys Leu Ala Val lie Gin Gin Ser Phe Pro lie Arg Trp Glu Ala 

1990 1995 2000 

GTT TCA GTA AAA GGA GGA TAC AAG CAG AAG TGG AGT ACT AGA GGT GAT 
Val Ser Val Lys Gly Gly Tyr Lys Gin Lys Trp Ser Thr Arg Gly Asp 
2005 2010 2015 

GGG CTC CCA AAA GAT ACC CGA ACT TCA GAC TCC TTG GCC CCA ATC GGG 
Gly Leu Pro Lys Asp Thr Arg Thr Ser Asp Ser Leu Ala Pro He Gly 
2020 2025 2030 

AAC TGG ATC AGA TCT CTG GAA TTG GTC CGA AAC CAA GTT CGT CTA AAT 
Asn Trp lie Arg Ser Leu Glu Leu Val Arg Asn Gin Val Arg Leu Asn 
2035 2040 2045 " 2050 

CCA TTC AAT GAG ATC TTG TTC AAT CAG CTA TGT CGT ACA GTG GAT AAT 
Pro Phe Asn Glu lie Leu Phe Asn Gin Leu Cys Arg Thr Val Asp Asn 

2055 2060 2065 

CAT TTG AAA TGG TCA AAT TTG CGA AGA AAC ACA GGA ATG ATT GAA TGG 
His Leu Lys Trp Ser Asn Leu Arg Arg Asn Thr Gly Met lie Glu Trp 

2070 2075 ~ 2080 

ATC AAT AGA CGA ATT TCA AAA GAA GAC CGG TCT ATA CTG ATG TTG AAG 
lie Asn Arg Arg lie Ser Lys Glu Asp Arg Ser lie Leu Met Leu Lys 
2085 2090 2095 

AGT GAC CTA CAC GAG GAA AAC TCT TGG AGA GAT TAA AAAATCATGA 
Ser Asp Leu His Glu Glu Asn Ser Trp Arg Asp 
2100 2105 



11002 



11050 



11098 



11146 



11134 



11242 



11290 



11338 



11386 



11434 



11482 



11530 



11578 



11526 



11674 



11722 



11768 
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GGAGACTCCA 


• AACTTTAAGT ATGAAAAAAA 


CTTTGATCCT 


TAAGACCCTC 


TTGTGGTTTT 


11828 


TATTTTTTAT 


' CTGGTTTTGT GGTCTTCGTG 


GGTCGGCATG 


GCATCTCCAC 


CTCCTCGCGG 


11888 


TCCGACCTGG 


GCATCCGAAG GAGGACGTCG 


TCCACTCGGA 


TGGCTAAGGG 


AGGGGCCCCC 


11948 


GCGGGGCTGC 


TAACAAAGCC CGAAAGGAAG 


CTGAGTTGGC 


TGCTGCCACC 


GCTGAGCAAT 


. 12008 


AACTAGCATA 


ACCCCTTGGG GCCTCTAAAC 


GGGTCTTGAG 

V 


• GGGTTTTTTG 


CTGAAAGGAG 


12068 


GAACTATATC 


CGGATCGAGA CCTCGATACT AGTGCGGTGG AGCTCCAGCT 


TTTGTTCCCT 


12128 


TTAGTGAGGG 


TTAATTTCGA GCTTGGCGTA 


ATCATGGTCA 


TAGCTGTTTC 


CTGTGTGAAA 


12188 


TTGTTATCCG 


CTCACAATTC CACACAACAT 


ACGAGCCGGA 


AGCATAAAGT 


GTAAAGCCTG 


12248 


GGGTGCCTAA 


TGAGTGAGCT AACTCACATT 


AATTG CGTTG 


CGCTCACTGC 


CCGCTTTCCA 


12308 


GTCGGGAAAC 


CTGTCGTGCC AG CT G C ATT A 


ATGAATCGGC 


CAACGCGCGG 


GGAGAGGCGG 


12368 


TTTGCGTATT 


GGGCGCTCTT CCGCTTCCTC 


GCTCACTGAC 


TCGCTGCGCT 


CGGTCGTTCG 


12428 


GCTGCGGCGA 


GCGGTATCAG CTCACTCAAA 


GGCGGTAATA 


CGGTTATCCA 


CAGAATCAGG 


12488 


GGATAACGCA 


GGAAAGAACA TGTGAGCAAA 


AGGCCAGCAA 


AAGGCCAGGA 


ACCGTAAAAA 


12548 


GGCCGCGTTG 


CTGGCGTTTT TCCATAGGCT 


CCGCCCCCCT 


GACGAGCATC 


ACAAAAATCG 


12608 


ACGCTCAAGT 


CAGAGGTGGC GAAACCCGAC 


AGGACTATAA 


AGATACCAGG 


CGTTTCCCCC 


12668 


TGGAAGCTCC 


CTCGTGCGCT CTCCTGTTCC 


GACCCTGCCG 


CTTACCGGAT ACCTGTCCGC 


12728 


CTTTCTCCCT 


TCGGGAAGCG TGGCGCTTTC 


TCATAGCTCA 


CGCTGTAGGT 


ATCTCAGTTC 


12788 


GGTGTAGGTC 


GTTCGCTCCA AGCTGGGCTG 


TGTGCACGAA 


CCCCCCGTTC 


AGCCCGACCG 


■ 1254 8 


CTGCGCCTTA 


TCCGGTAACT ATCGTCTTGA 


GTCCAACCCG 


GTAAGACACG 


ACTTATCGCC 


12908 


ACTGGCAGCA 


GCCACTGGTA ACAGGATTAG 


CAGAGCGAGG 


TATGTAGGCG 


GTGCTACAGA 


12968 


GTTCTTGAAG 


TGGTGGCCTA ACTACGGCTA 


CACTAGAAGG 


ACAGTATTTG 


GTATCTGCGC 


13028 


TCTGCTGAAG 


CCAGTTACCT TCGGAAAAAG 


AGTTGGTAGC 


TCTTGATCCG 


GCAAACAAAC 


13088 


GACCGCTGGT 


AGCGGTGGTT TTTTTG TTTG 


CAAGCAGCAG 


ATTACGCGCA 


GAAAAAAAGG 


13148 


ATCTCAAGAA 


GATCCTTTGA TCTTTTCTAC 


GGGGTCTGAC 


GCTCAGTGGA 


ACGAAAACTC 


13208 


ACGTTAAGGG 


ATTTTGGTCA TGAGATTATC 


AAAAAGGATC 


TTCACCTAGA 


TCCTTTTAAA 


13268 


TTAAAAATGA 


AGTTTTAAAT CAATCTAAAG 


TATATATGAG 


TAAACTTGGT 


CTGACAGTTA 


13328 


CCAATGCTTA 


ATCAGTGAGG CACCTATCTC 


AGCGATCTGT 


CTATTTCGTT 


CATCCATAGT 


13388 


TGCCTGACTC 


CCCGTCGTGT AGATAACTAC 


GATACGGGAG 


GGCTTACCAT 


CTGGCCCCAG 


13448 


TGCTGCAATG 


ATAC CGCGAG ACCCACGCTC 


ACCGGCTCCA 


GATTTATCAG 


CAATAAACCA 


13508 


GCCAGCCGGA 


AGGGCCGAGC- GC AG AAGTGG 


TCCTGCAACT 


TTATCCGCCT 


CCATCCAGTC 


13568 


TATTAATTGT 


TGCCGGGAAG CTAGAGTAAG 


TAGTTCGCCA 


GTTAATAGTT 


TGCGCAACGT 


13628 


TGTTGCCATT 


GCTACAGGCA TCGTGGTGTC 


ACGCTCGTCG 


TTTGGTATGG 


CTTCATTCAG 


13688 


CTCCGGTTCC 


CAACGATCAA GGCGAGTTAC 


ATGATCCCCC 


ATGTTGTGCA 


AAAAAGCGGT 


13748 


TAGCTCCTTC 


GGTCCTCCGA TCGTTGTCAG 


AAGTAAGTTG 


GCCGCAGTGT 


TATCACTCAT 


13808 
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GGTTATGGCA GCACTGCATA ATTCTCTTAC 
GACTGGTGAG TACTCAACCA AGTCATTCTG 



TGTCATGCCA TCCGTAAGAT GCTTTTCTGT 
AGAATAGTG T ATGCGGCGAC CGAGTTGCTC 



TTGCCCGGCG TCAATACGGG ATAATACCGC GCCACATAGC AGAACTTTAA AAGTGCTCAT 
CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC TTACCGCTGT TGAGATCCAG 

ATCTTCAGCA TCTTTTACTT TCACCAGCGT 
TGCCGCAAAA' AAGGGAATAA GGGCGACACG 
GAAATGTTGA ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA 
TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA TAGGGGTTCC 
GCG CACATTT CCCCGAAAAG TGC 



TTCGATGTAA CCCACTCGTG CACCCAACTG 

s 

TTCTGGGTGA GCAAAAACAG GAAGGCAAAA 



13868 

13928 

13988 

14048 

14108 

14168 

14228 

14288 

14311 



(2) INFORMATION FOR SEQ ID NO; 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

, (Xi) SEQUENCE DESCRIPTION SEQ ID NO:2: 

Met Ser Val Thr Val Lys Arg He He Asp Asn Thr Val He Val Pro 
1 5 10 15 

T-vs Leu Pro Ala Asn Glu Asp Pr-u val Tyr Pro Ala Asp Tyr Phe 

20 2Z ■ 30 

Arg Lys Ser Lys Glu He Pro Leu Tyr He Asn Thr Thr Lys Ser Leu 
35 40 45 

Ser Asp Leu Arg Gly Tyr Val Tyr Gin Gly Leu Lys Ser Gly Asn Val 
50 .55 60 

Ser He lie His Val Asn Ser Tyr Leu Tyr Gly Ala Leu Lys Asp He 
65 70 75 80 

Arg Gly Lys Leu Asp Lys Asp Trp Ser Ser Phe Gly lie Asn He Gly 

85 - 90 95 

Lys Ala Gly Asp Thr lie Gly He Phe Asp Leu Val Ser Leu Lys Ala 

100 105 no 

Leu Asp Gly Val Leu Pro Asp Gly Val Ser Asp Ala Ser Arg Thr Ser 
115 120 125 

Ala Asp Asp Lys Trp Leu Pro Leu Tyr Leu Leu Gly Leu Tyr Arg Val 
130 135 14 0 - - 

Gly Arg Thr Gin Met Pro Glu Tyr Arg Lys Lys Leu Met Asp Gly Leu 
145 150 155 i 60 

Thr Asn Gin Cys Lys Met He Asn Glu Gin Phe Glu Pro Leu Val Pro 

. 165 170 175 

Glu Gly Arg Asp He Phe Asp Val Trp Gly Asn Asp Ser Asn Tyr Thr 

18 0 185 iso 

Lys He Val Ala Ala Val Asp Met Phe Phe His Met Phe Lys Lys His 



- 88 - 



BNSDOCID: <WO _9634625A1JA> 



WO 96/34625 PCT/US96/06053 



195 200 



205 



Glu Cys Ala Ser Phe Arg Tyr Gly Thr lie Val Ser Arg Phe Lys Asp 

215 220 

Cys Ala Ala Leu Ala Thr Phe Gly His Leu Cys Lys lie Thr Gly Met 

230 23S 240 

Ser Thr Glu Asp Val Thr Thr Trp lie Leu Asn Arg Glu Val Ala Asp 

245 . 250 255 

Glu Met Val Gin Met Met Leu Pro Gly Gin Glu lie Asp Lys Ala Asp 

265 270 

Ser Tyr Met Pro Tyr Leu lie Asp Phe Gly Leu Ser Ser Lys Ser Pro 
* ' 3 280 



285 



Tyr Ser Ser Val Lys Asn Pro Ala Phe His Phe Trp Gly Gin Leu Thr 

2 55 300 

Ala Leu Leu Leu Arg Ser Thr Arg Ala Arg Asn Ala Arg Gin Pro Asp 

31 5 320 

Asp lie Glu Tyr Thr Ser Leu Thr Thr Ala Gly Leu Leu Tyr Ala Tyr 

325 330 335 

Ala val Gly ser Ser Ala Asp Leu Ala Gin Gin Phe Cys Val Gly Asp 

345 3 50 

Asn Lys Tyr Thr Pro Asp Asp Ser Thr Gly Gly Leu Thr Thr Asn Ala 
Ji,i> 360 



365 



Pro Pro Gin Gly Arg Asp Val Val Glu Trp Leu Gly Trp Phe Glu Asp 

. -'=> ■ _ . 380 

Gin Asn Arg Lys Pro Thr Pro Asp Met Met Gin Tyr Ala Lys Arg Ala 

390 395 * 400 

Val Met Ser Leu Gin Gly Leu Arg Glu Lys Thr lie Gly Lys Tyr Ala 

405 410 415 

Lys Ser Glu Phe Asp Lys 

420 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Asp Asn Leu Thr Lys Val Arg Glu Tyr Leu Lys Ser Tyr Ser Arg 

1° 15 

Leu Asp Gin Ala Val Gly Glu lie Asp Glu lie Glu Ala Gin Arg Ala 

20 25 3 0 

Glu Lys Ser Asn Tyr Glu Leu Phe Gin Glu Asp Gly Val Glu Glu His 
35 40 45 

Thr Lys Pro Ser Tyr Phe Gin Ala Ala Asp Asp Ser Asp Thr Glu Ser 

55 60 
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Glu Pro Glu lie Blu Asp Asn Gin Gly Leu Tyr Ala Pro Asp Pro Glu 

75 BO 

Ala Glu Gin Val Glu Glv Phe Hp n « n„ r> T 

^xu ^xy yne lie Gin Gly Pro Leu Asp Asp Tyr Ala 

85 90 95 

Asp" Glu Glu Val Asp Val Val Phe Thr Ser Asp Trp Lys Gin Pro Glu 

105 



110 



Leu Glu Ser Asp Glu His Gly Lys Thr Leu Arg Leu Thr Ser Pro Glu 



120 



125 



Gly Leu Ser Gly Glu Gin Lys Ser Gin Trp Leu Ser Thr lie Lys Ala 



"5 140 



Val val Gin Ser Ala Lys Tyr Trp Asn Leu Ala Glu Cys Thr Phe Glu 

150 15 5 160 

Ala Ser Gly Glu Gly Val lie Met Lys Glu Arg Gin He Thr Pro Asp 

"5 170 175 P 

Val Tyr Lys Val Thr Pro Val Met Asr. Thr His Pro Ser Gin Ser Glu ' 



185 



190 



Ala Val Ser Asp Val Trp Ser Leu Ser Lys Thr Ser Met Thr Phe Gin 

200 205 

Pro Lys Lys Ala Ser Leu Gin Pro Leu Thr lie Ser Leu Asp Glu Leu 

215 220 

Phe Ser Ser Arg Gly Glu Phe lie Ser Val Gly Gly Asp Gly Arg Met 

235 240 
S^ VAs Lys Glu Ala He Leu Leu G), Leu Ar, Tyr Lys Lys Leu Tyr 

255 

Asn Gin Ala Arg Val Lys Tyr Ser - Leu 

260 265 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ser Ser Leu Lys Lys lie Leu Giy Leu Lys Gly Lys Gly Lys Lys 

■ 10 15 

Ser Lys Lys Leu Gly lie Ala Pro Pro Pro Tyr Glu Glu Asp Thr Ser 

3 0 

Met Glu Tyr Ala Pro Ser Ala Pro lie Asp Lys Ser Tyr Phe Gly Val 

40 45 

Asp Glu Met Asp Thr Tyr Asp Pro Asn Gin Leu Arg Tyr Glu Lys Phe 

55 60 

Phe Phe Thr Val Lys Met Thr Val Arg Ser Asn Arg Pro Phe Arg Thr 

75 80 
Tyr Ser Asp Val Ala Ala Ala Val Ser His Trp Asp His Met Tyr He 
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Gly Met Ala Gly Lys Arg Pro Phe Tyr Lys lie Leu A ia Phe Leu Gly 



110 

Ser ser Asn Leu Lys Ala Thr Pro Ala val Leu Ala Asp Gin Gly Gin 

120 - 125 



Pro Glu Tyr His Thr His Cys Glu Gly Arg Ala Tyr Leu Pro His Arg 

135 140 a 

Met Gly Lys Thr Pro Pro Met Leu Asn Val Pro Glu His Phe Arg Arg 

155 160 

Pro Phe Asn lie Gly Leu Tyr Lys Gly Thr lie Glu Leu Thr Met Thr 

165 170 



He Tyr Asp Asp Glu Ser Leu Glu Ala Ala Pro Met lie Trp Asp His 

j <_ 185 



175 

190 



Phe Asn Ser Ser Lys Phe Ser Asp Phe Arg Glu Lys Ala Leu Met Phe 



205 



Gly Leu lie Val Glu Lys Lys Ala Ser Gly Ala Trp Val Leu Asp Ser 

215 220 

He Ser His Phe Lys 
225 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 511 amino ac^ds " 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Lys Cys Leu Leu Tyr Leu Ala Phe Leu Phe He Gly Val Asn Cys 

- 5 10 15 

Lys Phe Thr lie Val Phe Pro His Asn Gin Lys Gly Asn Trp Lys Asn 

25 30 
Val Pro Ser Asn Tyr His Tyr Cys Pro Ser Ser Ser Asp Leu Asn Trp 

His Asn Asp Leu lie Gly Thr Ala lie Gin Val Lys Met Pro Lys Ser 



His Lys Ala lie Gin Ala Asp Gly Trp Met Cys His Ala Ser Lys Trp 

70 7 5 80 

Val Thr Thr Cys Asp Phe Arg Trp Tyr Gly Pro Lys Tyr lie Thr Gin 

85 9 ° 95 

Ser lie Arg Ser Phe Thr Pro Ser Val Glu Gin Cys Lys Glu Ser He 



Glu Gin Thr Lys Gin Gly Thr Trp Leu Asn Pro Gly Phe Pro Pro Gin 

120 i2S 



Ser Cys Gly Tyr Ala Thr Val Thr Asp Ala Glu Ala Val He Val Gin 



"5 140 
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Val Thr Pro His His Val Leu Val Asp Glu Tyr Thr Gly Glu Trp Val 
145 150 155 160 

Asp Ser Gin Phe lie Asn Gly Lys Cys Ser Asn Tyr He Cys Pro Thr 

165 170 175 

Val His Asn Ser Thr Thr Trp His Ser Asp Tyr Lys Val Lys Gly Leu 

18° . 185 190 

Cys Asp Ser Asn Leu He Ser Met Asp He Thr Phe Phe Ser Glu Asp 
195 200 205 

Gly Glu Leu Ser Ser Leu Gly Lys Glu Gly Thr Gly Phe Arg Ser Asn 
210 215 220 

Tyr Phe Ala Tyr Glu Thr Gly Gly Lys Ala Cys Lys Met Gin Tyr Cys 
225 230 235 240 

Lys His Trp Gly Val Arg Leu Pro Ser Gly Val Trp Phe Glu Met Ala 

245 250 255 

Asp Lys Asp Leu Phe Ala Ala Ala Arg Phe Pro Glu Cys Pro Glu Glv 

260 265 270 

Ser Ser He Ser Ala Pro Ser Gin Thr Ser Val Asp Val Ser Leu He 
275. 280 285 

Gin Asp Val Glu Arg He Leu Asp Tyr Ser Leu Cys Gin Glu Thr Trp 
290 295 300 

Ser Lys lie Arg Ala Gly Leu Pro He Ser Pro Val Asp Leu Ser Tyr 

^ 315 320 

:.eu Air :.-o Lys Asn Pro Gly Thr Gly Vr j Ala Phe ~hr He He Asn' 

325 3-jo 335 . 

Gly Thr Leu Lys Tyr Phe Glu Thr Arg Tyr He Arg Val Asp He Ala 

340 345 350 

Ala Pro He Leu Ser Arg Met Val Gly Met He Ser Gly Thr Thr Thr 
355 360 365 

Glu Arg Glu Leu Trp Asp Asp Trp Ala Pro Tyr Glu Asp Val Glu He 
0 375 380 

Gly Pro Asn Gly Val Leu Arg Thr Ser Ser Gly Tyr Lys Phe Pro Leu 
385 390 395 400 

Tyr Met He Gly His Gly Met Leu Asp Ser Asp Leu His Leu Ser Ser 

4 05 410 415 

Lys Ala Gin Val Phe Glu His Pro His He Gin Asp Ala Ala Ser Gin 

420 425 4 3o 

Leu Pro Asp Asp Glu Ser Leu Phe Phe Gly Asp Thr Gly Leu Se- Lvs 
435 440 " 445 ~ y 

Asn Pro lie Glu Leu Val Glu Gly Trp Phe Ser Ser Trp Lys Ser Ser 
450 455 460 

He Ala Ser Phe Phe Phe He He Gly Leu He He Gly Leu Phe Leu 
65 470 475 480 

Val Leu Arg Val Gly He His Leu Cys He Lys Leu Lys His Thr Lys 

485 490 495 

Lys Arg Gin lie Tyr Thr Asp He Glu Met Asn Arg Leu Gly Lys 
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500 505 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



510 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

Met Glu Val His Asp Phe Glu Thr Asp Glu Phe Asn Asp Phe Asn Glu 
1 5 io 



15 



Asp Asp Tyr Ala Thr Arg Glu Phe Leu Asn Pro Asp Glu Arg Met Thr 

20 25 30 

Tyr Leu Asn His Ala Asp Tyr Asn Leu Asn Ser Pro Leu He Ser Asd 
35 40 45 * 

Asp He Asp Asn Leu He Arg Lys Phe Asn Ser Leu Pro He Pro Ser 
50 55 60 

Met Trp Asp Ser Lys Asn Trp Asp Gly Val Leu Glu Met Leu Thr Ser 
65 70 75 80 

Cys Gin Ala Asn Pro He Ser Thr Ser Gin Met His Lys Trp Met Gly 

85 90 '95 

Ser Trp Leu Met Ser Asp Asn His Asp Ala Ser Cin Glv Tyr Ser Phe 

100 105 ' 110 

Leu His Glu Val Asp Lys Glu Ala Glu He Thr Phe Asp Val Val Glu 
115 120 125 

Thr Phe He Arg Gly Trp Gly Asn Lys Pro He Glu Tyr He Lys Lvs 
I 30 135 i 4 o Y Y 

Glu Arg Trp Thr Asp Ser Phe Lys He Leu Ala Tyr Leu Cys Gin Lys 
145 150 155 i6o 

Phe Leu Asp Leu His Lys Leu Thr Leu He Leu Asn Ala Val Ser Glu 

165 170 i7 5 

Val Glu Leu Leu Asn Leu Ala Arg Thr Phe Lys Gly Lys Val Arg Arq 

180 185 * 190 

Ser Ser His Gly Thr Asn He Cys Arg He Arg Val Pro Ser Leu Glv 
195 200 " 205 

Pro Thr Phe He Ser Glu Gly Trp Ala Tyr Phe Lys Lys Leu Asp He 
210 215 220 

Leu Met Asp Arg Asn Phe Leu Leu Met Val Lys Asp Val He He Glv 
225 230 235 240 

Arg Met Gin Thr Val Leu Ser Met Val Cys Arg He Asp Asn Leu Phe 

245 250 * 255 

Ser Glu Gin Asp He Phe Ser Leu Leu Asn lie Tyr Arg lie Gly Asp 

260 265 * 270 

Lys lie Val Glu Arg Gin Gly Asn Phe Ser Tyr Asp Leu lie Lys Met 
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275 



280 



285 



Val Glu Pro lie Cys Asn Leu Lys Leu Met Lys Leu Ala Arg Glu Ser 

295 300 

Arg Pro Leu Val Pro Gin Phe Pro His Phe Glu Asn His lie Lys Thr 

315 320 

Ser val Asp Glu Gly Ala Lys lie Asp Arg Gly lie Arg Phe Leu His 

325 .330 .335 

Asp Gin lie Met Ser Val Lys Thr Val Asp Leu Thr Leu Val lie Tyr 

345 

Gly ser Phe Arg His Trp Gly His Pro Phe lie Asp Tyr Tyr Thr Gly 

365 

Leu Glu Lys Leu His Ser Gin Val Thr Met Lys Lys Asp He Asp Val 

375 380 

Ser Tyr Ala Lys Ala Leu Ala Ser Asp Leu Ala Arg lie Val Leu Phe 

390 3S5 4Q0 

Gin Gin Phe Asn Asp His Lys Lys Trp Phe Val Asn Gly Asp Leu Leu 

1 5 

Pro His Asp His Pro Phe Lys Ser His Val Lys Glu Asn Thr Trp Pro 

425 430 

Thr Ala Ala Gin Val Gin Asp Phe Gly Asp Lys Trp His Glu Leu Pro 

440 445 

Leu lie Lys Cys Phe Glu lie Pro Asp Leu Leu Asp Pro Ser lie He 

Tyr ser Asp Lys Ser His Ser Met Asn Arg Ser Glu Val Leu Lys His 

475 480 
Val Arg Met Asn Pro Asn Thr Pro lie Pro Ser Lys Lys Val Leu Gin 

485 490 495 

Thr Met Leu Asp Thr Lys Ala Thr Asn Trp Lys Glu Phe Leu Lys Glu 

He Asp Glu Lys Gly Leu Asp Asp Asp Asp Leu lie He Gly Leu Lys 

; 52 0 525 

Gly Lys Glu Arg Glu Leu Lys Leu Ala Gly Arg Phe Phe Ser Leu Met 

535 540 

Ser Trp Lys Leu Arg Glu Tyr Phe Val He Thr Glu Tyr Leu He Lys 

550 555 560 

Thr His Phe Val Pro Met Phe Lys Gly Leu Thr Met Ala Asp Asp Leu 

555 57 ° 575 

Thr Ala Val lie Lys Lys Met Leu Asp Ser Ser Ser Gly Gin Gly Leu 

585 590 . . 

Lys ser Tyr Glu Ala He Cys lie Ala Asn His He Asp Tyr Glu Lys 

Trp Asn Asn His Gin Arg Lys Leu Ser Asn Gly Pro Val Phe Arg Val 

615 620 
Met Gly G m Phe Leu Gl Tyr pro ser ^ ne ^ t ^ 

630 "5 6 4 0 
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Phe Phe Glu Lys Ser Leu lie Tyr Tyr Asn Gly Arg Pro Asp Leu Met 

645 650 " 655 

Arg Val His Asn Asn Thr Leu He Asn Ser Thr Ser Gin Arg Val Cvs 

660 665 6 7o 

Trp Gin Gly Gin Glu Gly Gly Leu Glu Gly Leu Arg Gin Lys Gly Tro 
675 680 685 

' Thr lit LeU ASn L6U Leu Val X ^ e 01x1 Ar 9 Glu Ala Lys He Arg Asn 
690 695 700 

Thr Ala Val Lys Val Leu Ala Gin Gly Asp Asn Gin Val He Cys Thr 
705 710 715 7 2 o 

Gin Tyr Lys Thr Lys Lys Ser Arg Asn Val Val Glu Leu Gin Gly Ala 

725 730 735 

Leu Asn Gin Met Val Ser Asn Asn Glu Lys He Met Thr Ala He Lys 

740 745 750 Y 

He Gly Thr Gly Lys Leu Gly Leu Leu He Asn Asp Asp Glu Thr Met 
755 760 765 

Gin Ser Ala Asp Tyr Leu Asn Tyr Gly Lys He Pro lie Phe Arg Gly 

Val He Arg Gly Leu Glu Thr Lys Arg Trp Ser Arg Val Thr Cys Val 
785 790 795 eoo 

Thr Asn Asp Gin lie Pro Thr Cys Ala Asn He Met Ser Ser Val Ser 

805 810 8 15 

Thr Asn Ala - Leu Thr Val Ala His Phe Ma Glcv Asn Pvu He Asn Ala 

820 825 " 8 3o 

Met He Gin Tyr Asn Tyr Phe Gly Thr Phe Ala Arg Leu Leu Leu Met 
835 840 84 5 

Met His Asp Pro Ala Leu Arg Gin Ser Leu Tyr Glu Val Gin Asp Lys 
850 . 855 8 60 

He Pro Gly Leu His Ser Ser Thr Phe Lys Tyr Ala Met Leu Tyr Leu 
865 870 875 880 

Asp Pro Ser lie Gly Gly Val Ser Gly Met Ser Leu Ser Arg Phe Leu 

885 890 89 5 

He Arg Ala Phe Pro Asp Pro Val Thr Glu Ser Leu Ser Phe Trp Arc? 

900 Qf>5 r a 



910 



Phe He His Val His Ala Arg Ser Glu His Leu Lys Glu Met Ser Ala 
915 920 9 25 

Val Phe Gly Asn Pro Glu lie Ala Lys Phe Arg lie Thr His He Asp 

Lys Leu Val Glu Asp Pro Thr Ser Leu Asn He Ala Met Gly Met Ser 
945 950 . 955 seo 

Pro Ala Asn Leu Leu Lys Thr Glu Val Lys Lys Cys Leu lie Glu Ser 

965 970 * 975 

* 

Arg Gin Thr He Arg Asn Gin Val lie Lys Asp Ala Thr He Tyr Leu 

980 985 * 99 0 * 

Tyr His Glu Glu Asp Arg Leu Arg Ser Phe Leu Trp Ser lie Asn Pro 
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995 1000 1005 

Leu Phe Pro Arg Phe Leu Ser Glu Phe Lys Ser Gly Thr Phe Leu Gly 
1010 1015 " 1020 

Val Ala Asp Gly Leu He Ser Leu Phe Gin Asn Ser Arg Thr He Arg 
1025 1030 1035 " 1040 ■'" 

Asn Ser Phe Lys Lys Lys Tyr His Arg Glu Leu Asp Asp Leu He Val 

* 1045 1050 ~ 1055 

Arg Ser Glu Val Ser Ser Leu Thr His Leu Gly Lys Leu His Leu Arg 

1060 1065 * 1070 

Arg Gly Ser Cys Lys Met Trp Thr Cys Ser Ala Thr His Ala Asp Thr 
1075 1080 1085 

Leu Arg Tyr Lys Ser Trp Gly Arg Thr Val He Gly Thr Thr Val Pro 
1090 1095 1100 

His Pro Leu Glu Met Leu Gly Pro Gin His Arg Lys Glu Thr Pro Cys 
1105 llio ins . 1120 

Ala Pro Cys Asn Thr Ser Gly Phe Asn Tyr Val Ser Val His Cys Pro 

1125 1130 1135 

Asp Gly He His Asp Val Phe Ser Ser Arg Gly Pro Leu Pro Ala Tyr 

1140 1145 ~ 1150 

Leu Gly Ser Lys Thr Ser Glu Ser Thr Ser He Leu Gin Pro Trp Glu 
1155 1160 H65 

Arg Glu Ser Lys Val Pro Leu He Lys Arg Ala Thr Arg Leu Arg Asp 
H7C . . 1175 HC U 

Ala He Ser Trp Phe Val Glu Pro Asp Ser Lys Leu Ala Met Thr lie 
H85 ( 1190 1195 1200 

Leu Ser Asn lie His Ser Leu Thr Gly Glu Glu Trp Thr Lys Arg Gin 

1205 1210 1215 

His Gly Phe Lys Arg Thr Gly Ser Ala Leu His Arg Phe Ser Thr Ser 

1220 1225 ~ 1230 

Arg Met Ser His Gly Gly Phe Ala Ser Gin Ser Thr Ala Ala Leu Thr 
1235 1240 1245 

Arg Leu Met Ala Thr Thr Asp Thr Met Arg Asp Leu Gly Asp Gin Asn 
1250 1255 1260 

Phe Asp Phe Leu Phe Gin Ala Thr Leu Leu Tyr Ala Gin lie Thr Thr 
1265 1270 1275 1280 

Thr Val Ala Arg Asp Gly Trp lie Thr Ser Cys Thr Asp His Tyr His 

1285 1290 1295 

I1& Ala Cys Lys Ser Cys Leu Arg Pro He Glu Glu lie Thr Leu Asp 

1300 1305 1310 

Ser Ser Met Asp Tyr Thr Pro Pro Asp Val Ser His Val Leu Lys Thr 
1315 1320 1325 

Trp Arg Asn Gly Glu Gly Ser Trp Gly Gin Glu lie Lys Gin lie Tyr 
1330 1335 1340 

Pro Leu Glu Gly Asn Trp Lys Asn Leu Ala Pro Ala Glu Gin Ser Tyr 
13 45 1350 1355 1360 
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Gin Val Gly Arg Cys lie Gly Phe Leu Tyr Gly Asp Leu Ala Tyr Arg 

1365 1370 ~ 1375 

Lys Ser Thr His Ala Glu Asp Ser Ser Leu Phe Pro Leu Ser lie" Gin 

1380 1385 1390 

Gly Arg lie Arg Gly Arg Gly Phe Leu Lys Gly Leu Leu Asp Gly Leu 
1395 1400 1405 

Met Arg Ala Ser Cys Cys Gin Val' lie His Arg Arg Ser Leu Ala His , 
1410 s 1415 " 1420 

Leu Lys Arg Pro Ala Asn Ala Val Tyr Gly Gly Leu lie Tyr Leu lie 
1425 1430 ' 1435 1440 

Asp Lys Leu Ser Val Ser Pro Pro Phe Leu Ser Leu Thr Arg Ser Gly 

1445 1450 1455 

Pro lie Arg Asp Glu Leu Glu Thr lie Pro His Lys lie Pro Thr Ser 

1460 -1465 1470 

Tyr Pro Thr Ser Asn Arg Asp Met Gly Val lie Val Arg Asn Tyr Phe 

1475 1480 1485 

Lys Tyr Gin Cys Arg Leu lie Glu Lys Gly Lys Tyr Arg Ser His Tyr 
14 90 14 95 15 00 

Ser Gin Leu Trp Leu Phe Ser Asp Val Leu Ser lie Asp Phe lie Gly 
1505 1510 1515 * 1520 

Pro Phe Ser lie Ser Thr Thr Leu Leu Gin He Leu Tyr Lys Pro Phe 

1525 1530 * 1535 

Leu Ser Gly Lys Asp Lys Asn Glu Leu Arg Old -Leu. Ala ^^n Leu Ser 

1540 1545 1550 



Ser Leu Leu Arg Ser Gly Glu Gly Trp Glu Asp He His Val Lys Phe 
1555 1560 1565 

Phe Thr Lys Asp He Leu Leu Cys Pro Glu Glu He Arg His Ala Cys 
1570 1575 1580 

Lys Phe Gly He Ala Lys Asp Asn Asn Lys Asp Met Ser Tyr Pro Pro 
1585 1590 1595 1600 

Trp Gly Arg Glu Ser Arg Gly Thr He Thr Thr He Pro Val Tyr Tyr 

1605 1610 1615 

Thr Thr Thr Pro Tyr Pro Lys Met Leu Glu Met Pro Pro Arg He Gin 

1620 1625 1630 

Asn Pro Leu Leu Ser Gly He Arg Leu Gly Gin Leu Pro Thr Gly Ala 
1635 1640 1645 

His Tyr Lys lie Arg Ser He Leu His Gly Met Gly He His Tyr Arg 
1650 1655 1660 

Asp Phe Leu Ser Cys Gly Asp Gly Ser Gly Gly Met Thr Ala Ala Leu 
1665 1670 1675 1680 

Leu Arg Glu Asn Val His Ser Arg Gly He Phe Asn Ser Leu Leu Glu 

1685 1690 1695 

Leu Ser Gly Ser Val Met Arg Gly Ala Ser Pro Glu Pro Pro Ser Ala 

1700 1705 1710 

Leu Glu Thr Leu Gly Gly Asp Lys Ser Arg Cys Val Asn Gly Glu Thr 
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"15 1720 



1725 



Cys Trp^Glu Tyr Pro Ser Jjpta Cys Asp Pro Arg Thr Trp Asp Tyr 

Phe Leu Arg Leu Lys Ala Gly Leu Gly Leu Gin He Asp Leu lie Val 

1750 1755 1760 

Met Asp Met Glu Va^Arg Asp Ser Ser Thr Ser Leu Lys lie Glu Thr 

1775 



1765 1770 s 



Asn Val Arg Tyr Val His Arg lie Leu Asp Glu Gin Gly Val Leu 

1780 "85 1790 

lie Tyr Lys Thr Tyr Gly Thr Tyr Tie Cys Glu Ser Glu Lys Asn Ala 
1795 1800 . 1805 

IsiO 116 i^ 5 Phe Val i^ 0 LeU Val GlD Thr 

GluPhe Ser Ser Ser Gin Thr Ser Glu Val Tyr Met Val Cys Lys Gly 

. eJU 1835 1840 

Leu Lys Lys Leu lie Asp Glu Pro Asn Pro Asp Trp Ser Ser II 



1845 1850 



e Asn 



1855 



Glu Ser Trp Lys Asn Leu Tyr Ala Phe Gin Ser Ser Glu Gin Glu Phe 

1860 1865 1870 

Ala Arg Ala Lys Lys Val Ser Thr Tyr Phe Thr Leu Thr Gly lie Pro 
1875 1880 1885 

SSr ipg. Phe 116 Pr ° ASP ??° Phe Val Asn lie Glu Thr Met Leu Gin 

1895 1900 

lie Phe Gly Val Pro Thr Gly Val Ser His Ala Ala Ala Leu Lys Ser 

1910 "IS 1920 

Ser Asp Arg Pro Ala Asp Leu Leu Thr lie Ser Leu Phe Tyr Met Ala 

1925 1930 1935 

He lie Ser Tyr Tyr Asn He -Asn His lie Arg Val Gly Pro lie Pro 

1940 1945 1SS0 

Pro Asn Pro Pro Ser Asp Gly He Ala Gin Asn Val Gly lie Ala lie 

1960 1965 

Thr Gly ile Ser Phe Trp Leu Ser Leu Met Glu Lys Asp lie Pro Leu 

1975 1980 

Tyr^Gln Gin Cys Leu Ala^Val He Gin Gin Serine 'Pro lie Arg Trp 

Glu Ala Val Ser Val Lys Gly Gly Tyr Lys Gin Lys Trp Ser Thr Arg 

2005 2010 2015 

Gly Asp Gly Leu Pro Lys Asp Thr Arg Thr Ser Asp Ser Leu Al 



2020 2025 



a Pro 



2030 



He Gly AsnTrp He Arg Ser Leu Gi u Leu Val Arg Asn Gin Val Arg 



2045 



Leu Asn Pro Phe Asn Glu lie Leu Phe Asn Gin Leu Cys Arg Thr Val 

2055 2060 

Asp Asn His Leu Lys Trp Ser Asn Leu Arg Arg Asn Thr Gly Met Ile 

2070 2075 2080 
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Glu Trp lie Asn Arg Arg He Ser Lys Glu Asp Arg Ser He Leu Met 

2085 2090 2095 

Leu Lys Ser Asp Leu His Glu Glu Asn Ser Trp Arg Asp 

2100 2105 

{2} INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



bUil 1111 lb 


bubAAA I G 1 G 


GGGGGAACCG 


CTATTTGTTT 


ATTTTTCTAA 


ATACATTCAA 


60 


AiAiblAILL 


GL. 1 LATGAGA 


^ 7\ 7\ rri 71 71 « « suit 

C AATAAC C CT 


GATAAATGCT 


mm* Mm m, «■ if|« • m 

TCAATAATAT 


■■■ Mm mi m\ % M JHM IK 

TGAAAAAGGA 


120 


AG AG X A 1 GAG 


TAT 1 LAAC AT. 


TTCCGTGTCG 


C C CTTATTC C 


CTTTTTTGCG 


GCATTTTGCC 


180 


TTGC 1 G 1 i TT 


TGCTCACCCA 


G AAACG CTGG 


TGAAAGTAAA 


AGATGCTGAA 


GATCAGTTGG 


240 


GTGCACGAGT 


GGGTTACATC 


GAACTGGATC 


TCAACAGCGG 


TAAGATCCTT 


GAGAGTTTTC 


300 


GCCCCGAAGA 


ACGTTTTCCA 


ATGATGAGCA 


CTTTTAAAGT 


TCTGCTATGT 


GGCGCGGTAT 


36 0. 


TATCCCGTAT 


TGACGCCGGG 


CAAGAGCAAC 


TCGGTCGCCG 


CaTACACT VT 


TCTvCPGAATG 


420 


ACTTGGTTGA 


GTACTCACCA 


GTCACAGAAA 


AGCATCTTAC 


GGATGGCATG 


ACAGTAAGAG 


460 


AATTATG CAG 


TGCTGCCATA 


ACCATGAGTG 


ATAACACTGC 


GGCCAACTTA 


CTTCTGACAA 


540 


CGATCGGAGG 


ACCGAAGGAG 


CTAACCG CTT 


TTTTGCACAA 


CATGGGGGAT 


CATGTAACTC 


600 


GCCTTGATCG 


TTGGGAACCG 


GAGCTGAATG 


AAGCCATACC 


AAACGACGAG 


C GTGACACC A 


660 


CGATGCCTGT 


AGCAATGGCA 


ACAACGTTGC 


GCAAACTATT 


AACTGGCGAA 


CTACTTACTC 


720 


TAGCTTC CCG 


GCAACAATTA 


ATAGACTGGA 


TGGAGGCGGA 


TAAAGTTGCA 


GGAC CACTTC 


780 


TGCGCTCGGC 


CCTTCCGGCT 


GGCTGGTTTA 


TTGCTGATAA 


ATCTGGAGCC 


GGTGAGCGTG 


840 


GGTCTCGCGG 


TATCATTGCA 


GCACTGGGGC 


CAGATGGTAA 


GCCCTCCCGT 


ATCGTAGTTA 


900 


TCTACACGAC 


GGGGAGTCAG 


GCAACTATGG 


ATGAACGAAA 


TAGACAGATC 


GCTGAGATAG 


960 


GTGCCTCACT 


GATTAAGCAT 


TGGTAACTGT 


CAGACCAAGT 


TTACTCATAT 


ATACTTTAGA 


1020 


TTGATTTAAA 


ACTTCATTTT 


TAATTTAAAA 


GGATCTAGGT 


GAAGATC CTT 


TTTGATAATC 


1080 


TCATGACCAA 


AATCCCTTAA 


CGTGAGTTTT 


CGTTCCACTG 


AGCGTCAGAC 


CCCGTAGAAA 


114 0 


AGATCAAAGG 


ATCTTCTTGA 


GATCCTTTTT 


TTCTGCGCGT 


AATCTGCTGC 


TTGCAAACAA 


1200 


AAAAACCACC 


GCTACCAGCG 


GTGGTTTGTT 


TGCCGGATCA 


AGAGCTACCA 


ACTCTTTTTC 


1260 


CGAAGGTAAC 


TGGCTTCAGC 


AGAGCGCAGA 


TACCAAATAC 


TGTCCTTCTA' 


GTGTAGCCGT 


1320 


AGTTAGGCCA 


CCACTTCAAG 


AACTCTGTAG 


CACCGCCTAC 


ATACCTCGCT 


CTGCTAATCC 


1380 
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TGTTACCAGT GGCTGCTGCC AGTGGCGATA AGTCGTGTCT TACCGGGTTG GACTCAAGAC 144 0 

GATAGTTACC GGATAAGGCG CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA" 1500 
GCTTGGAGCG AACGACCTAC AC CGAACTG A GATACCTACA GCGTGAGCTA TGAGAAAGCG 1560 
CCACGCTTCC CGAAGGGAGA AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG 1620 
GAGAGCGCAC GAGGGAGCTT CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT 1680 
TTCGCCACCT CTGACTTGAG CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT 174 0 
GGAAAAACGC CAGCAACGCG GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC 
ACATGTTCTT TCCTGCGTTA TCCCCTGATT CTGTGGATAA CCGTATTACC GCCTTTGAGT 
GAGCTGATAC CGCTCGCCGC AGCCGAACGA CCGAGCGCAG CGAGTCAGTG AGCGAGGAAG 
CGGAAGAGCG CCCAATACGC AAACCGCCTC TCCCCGCGCG TTGGCCGATT CATTAATGCA 
GCTGGCACGA CAGGTTTCCC GACTGGAAAG CGGGCAGTGA GCGCAACGCA ATTAATGTGA 
GTTAGCTCAC TCATTAGGCA CCCCAGGCTT TACACTTTAT GCTTCCGGCT CGTATGTTGT 
GTGGAATTGT GAGCGGATAA CAATTTCACA CAGGAAACAG CTATGACCAT GATTACGCCA 
AGCTCGAAAT TAACCCTCAC TAAAGGGAAC AAAAG CTGGA GCTCCACCGC ACTAGTATCG 
AGGTCTCGAT CCGGATATAG TTCCTCCTTT CAGCAAAAAA CCCCTCAAGA CCCGTTTAGA 
GGCCCCAAGG GGTTATGCTA GTTATTGCTC AGCGGTGGCA GCAGCCAACT CAGCTTCCTT 

TCGGGCTTTG TTAGCAGCCC CGCGGGGGCC CCTCCCTTAG CCATCCGAGT GGACGACGTC 

. - ■* . ■»■ 

CTCCTTCGGA TGCCCAGGTC GGACCGCGAG GAGGTGGAGA TGCCATGCCG ACc^/t^A/wr 

4 

ACCACAAAAC CAGATAAAAA ATAAAAACCA CAAGAGGGTC TTAAGGATCA AAGTTTTTTT 
CATACTTAAA GTTTGGAGTC TCCTCATGAT TTTTTAATCT CTCCAAGAGT TTTCCTCGTG 
TAGGTCACTC TTCAACATCA GTATAGACCG GTCTTCTTTT GAAATTCGTC TATTGATCCA 
TTCAATCATT CCTGTGTTTC TTCGCAAATT TGACCATTTC AAATGATTAT CCACTGTACG 
ACATAGCTGA TTGAACAAGA TCTCATTGAA TGGATTTAGA CGAACTTGGT TTCGGACCAA 
TTCCAGAGAT CTGATCCAGT TCCCGATTGG GGCCAAGGAG TCTGAAGTTC GGGTATCTTT 
TGGGAGCCCA TCACCTCTAG TACTCCACTT CTGCTTGTAT CCTCCTTTTA CTGAAACAGC 
CTCCCACCTA ATCGGGAATG ATTGCTGGAT AACTGCTAAA CACTGTTGAT ATAGTGGAAT 
GTCTTTCTCC ATCAAACTCA GCCAAAAGCT TATACCAGTT ATAGCGATCC CCACATTTTG 
TG CAATTCCA TCTGATGGGG GGTTCGGAGG TATCGGTCCT ACTCTGATAT GATTGATGTT 
ATAATACGAT ATAATCGCCA TATAAAAAAG GCTAATGGTC AATAAATCTG CAGGTCTATC 
AGATGATTTT AAGGCAGCCG CATGAGACAC ACCCGTGGGT ACTCCGAATA TTTGTAGCAT 
AGTCTCAATG TTTACAAAAG GATCAGGAAT GAATTGGGAG GGAATACCTG TCAAGGTAAA 
GTATGTACTA ACCTTCTTTG CTCTGGCAAA TTCCTGTTCT GATGACTGGA ATGCGTACAG 
GTTTTTCCAG GATTCATTGA TGGAAGACCA ATCGGGATTG GGTTCATCGA TTAATTTCTT 
CAAACCTTTA CATACCATAT ATACTTCAGA CGTTTGAGAA CTACTAAATT CTGTTTGAAC • 



1800 

I860 

1920 

1980 

2040 

2100 

2160 

2220 

22B0 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 
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TAAGTCGACC 


GTCTTGAACA 


TGGGACCAAG 


GATTGTTACT 


GCATTCTTTT 


CGCTCTCACA 


3480 


AATATATGTT 


CCATAAGTCT 


TGTAGATTAA 


AACTCCTTGC 


TCATCCAAAA 


TCCGGTGCAC 


3540 


ATAATTTCTA 


ACATTCGTCT 


CAATTTTCAG 


GCTAGTAGAA 


GAATCCCGAA 


CTTCCATATC 


3600 


CATTACAATT 


AAATCAATTT 


GAAGCCCCAA 


GCCTGCTTTG 


AGTCGGAGGA 


AATAGTCCCA 


3660 


AGTCCTTGGG 


TCACATAAGT 


CAGATGGATA 


TTCCCAACAT 


GTTTCACCAT 


TTACACATCT 


3720 


CGATTTATCT 


CCTCCTAAAG 


TTTCTAGGGC 


ACTGGGGGGC 


TCAGGAGAGG 


CGCCTCGCAT 


3780 


GACTGACCCT 


GATAATTCTA 


ACAGACTATT 


GAATATTCCT 


CTGCTATGCA 


CATTTTCTCG 


3840 


TAGTAATGCA 


GCAGTCATCC 


CTCCGGAGCC 


GTCTCCACAA CTCAAGAAGT 


CCCTGTAATG 


3900 


GATTCCCATT 


CCATGTAATA 


TACTCCGAAT 


TTTATAATGA 


GCGCCAGTTG 


GTAATTGGCC 


3960 


CAACCTGATT 


CCGGACAGCA 


GGGGATTTTG 


GATTCTTGGA 


GGCATCTCTA 


GCATCTTTGG 


4020 


GTAAGGGGTG 


GTCGTATAAT 


AAACAGGGAT 


TGTTGTAATT 


GTCCCTCTGG 


ATTCCCTTCC 


4080 


CCAAGGGGGA 


TAGCTCATGT 


CTTTATTATT 


ATCCTTAGCA 


ATCCCGAACT 


TGCAAGCATG 


4140 


TCTGATTTCC 


TCTGGACACA 


ATAATATGTC 


CTTGGTGAAG 


AATTTCACAT 


GTATGTCTTC 


4200 


CCACCCCTCT 


CCTGATCTTA 


GCAATGAAGA 


AAGATTTGCC 


AGCTCTCTCA 


ACTCATTCTT 


4260 


ATCTTTCCCA 


GATAAAAATG 


GCTTGTATAG 


GATTTGCAAG 


AGGGTGGTGG 


AAATAGAGAA 


4320 


TGGTCCAATG 


AAGTCTATGG 


ATAAGACATC 


TGAGAATAAC 


CATAATTGTG 


AATAATGTGA 


4380 


TCTGTATTTT 


CCCTTTTCAA 


TTAGACGGCA 


TTGGTATTTG 


AAGTAATTTC 


TGACAATCAC 


4440 


CCCCATATCA 


CGGTTGCTTG 


TCGGATAGGA 


GGTTGGGATC 


TTGTGGGGAA TCGTTTCTAA 


4500 


TTCGTCTCTA 


ATAGGTCCTG 


AT CTAGTAAG 


AGAAAGGAAT 


GGAGGTGATA 


CACTCAATTT 


4560 


ATCAATCAAG 


TAAATCAAAC 


CTCCGTACAC 


TGCGTTGGCC 


GGCCTCTTCA 


AATGAGCCAG 


4620 


ACTTCTCCGG 


TGTATTACTT 


GGCAGCAACT 


TGCTCTCATT 


AATCCGTCTA 


GCAACCCTTT 


4680 


TAAGAAACCT 


CGACCTCTAA 


TACGACCTTG 


TATAGATAGA 


GGAAATAGAG 


AACTGTCCTC 


4740 


GGCATGAGTA 


GATTTTCTAT 


ACGCCAAGTC 


T C CATATAG A 


AAACCTATAC 


ATCTGCCGAC 


4800 


TTGATAGGAT 


TGCTCAGCAG 


GTGCTAAATT 


CTTCCAATTC 


CCTTCTAAAG 


GATAGATCTG 


4860 


TTTTATCTCT 


TGTCCCCACG 


AACCTTCCCC 


ATTCCTCCAT 


GTCTTCAGCA 


CATGGGATAC 


4920 


ATCTGGGGGC 


GTGTAGTCCA . 


TACTTGAGTC 


CAGGG TGATC 


TCTTCTATGG 


GTCTCAAACA 


4980 


GGACTTACAG 


GCAATATGAT 


AATGATCTGT 


ACAACTGGTG 


ATCCATCCGT 


CTCTTGCAAC 


5040 


AGTGGTGGTA 


ATTTGAGCAT 


AGAGCAACGT 


TGCTTGGAAT 


AAAAAGTCGA 


AATTCTGATC 


5100 


TCCCAGATCC 


CTCATGGTGT 


CTGTAGTTGC 


CATCAACCTG 


GTCAATGCTG 


CAGTGCTCTG 


5160 


AGATGCGAAC 


CCACCATGGC 


TCATCCGAGA 


TGTCGAAAAC 


CTATGAAGGG 


CAGACCCTGT 


5220 


TCTTTTGAAC 


CCATGCTGCC 


TTTTGGTCCA 


TTCTTCGCCT 


GTTAAAGAGT 


GGATGTTAGA 


5280 


AAGTATAGTC 


ATTGCTAGTT 


TAGAGTCGGG 


TTCAACAAAC 


CAAGAGATAG 


CATCTCTAAG 


5340 


ACGTGTAGCT 


CTTTTAATCA 


GTGGGACTTT 


GCTTTCCCTT 


TCCCAAGGCT 


GCAAAATAGA 


5400 


TGTAGATTCA 


GATGTTTTAG 


ACCCTAGATA 


AGCAGGCAAT 


GGTCCCCGTG 


AACTAAAGAC 


5460 
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GTCATGGATC CCGTCTGGAC AATGCACAGA AACATAATTG AACCCTGATG TGTTACATGG 552 0 

TGCACAAGGA GTCTCTTTTC GATGTTGTGG ACCC AACATT TCTAATGGAT GGGGTACAGT 558 0 

TGTCCCAATA ACTGTACGGC CCCAGGATTT GTATCTTAAT GTGTCAGCAT GAGTAGCTGA 5640 

AC ATGTC CAC ATTTTACATG ATCCCCTTCT CAAATGAAGT TTCCCTAAAT GTGTCAAAGA 57 00 

GGATACCTCA CTCCTCACAA TCAAATCATC CAATTCCCTA TGATACTTTT TCTTAAAGGA 5760 

GTTCCGAATA GTACGAGAAT TTTGAAATAG ACTGATGAGC CCGTCTGCGA CTCCCAAAAA 5820 

AGTGCCTGAT TTGAATTCAC TTAAAAATCT AGGGAACAGA GGATTTATTG ACCATAAGAA 5880 

ACTTCTGAGC CGATCCTCTT CATGATACAA ATATATGGTT GCATCCTTAA TCACCTGGTT 594 0 

CCTGATGGTT TGTCTTGATT CGATTAAG C A TTTTTTAACC TCAGTCTTTA ACAAGTTCGC 6 00 0 

TGGACTCATT CCCATAGCGA TGTTCAGAGA GGTTGGATCT TCTACTAGCT TGTCTATGTG 6 06 0 

AGTTATTCGA AAGTTGGCTA TCTCGGGGTT TCCAAATACT GCACTCATCT CCTTCAGATG 6120 

CTCACTTCGA GCATGTACAT GGATGAATCT CCAGAATGAG AGACTTTCTG TTACGGGATC 618 0 

TGGGAAGGCT CTAATCAAAA ACCTGGACAA AGACATGCCC GACACTCCTC CAATGGAAGG 624 0 

GTCCAAATAC AACATGGCGT ATTTGAAAGT AGAACTGTGC AAGCCCGGTA TCTTATCTTG 6300 

AACTTCATAC AATGATTGAC GAAGAGCAGG ATCATG CATC ATCAACAAGA GTCTAGCAAA 6360 

TGTCCCAAAA TAATTGTACT GTATCATGGC ATTGATTGGG TTCTCAGCAA AATG AG CTAC 642 0 

GGTGAG AG CA TTTGTGGAAA CTGAGCTCAT TATATTAGCA CAAGTGGGTA TTTGGTCATT 6480 

GGTGACACAA GTCACTCGTG ACCATCTCTT GGTCTCTAAC CCTCTAATCA CTCuACi^GAA : 6 54 0 

AATCGG T ATT TTTCCATAAT TCAAGTAATC TGCAGATTGC ATAGTCTCAT CGTCATTTAT 6600 

CAAAAGTCCT AACTTCCCTG TCCCTATTTT GATTGCAGTC ATAATTTTCT CATTATTAGA 666 0 

AACCATTTGA TTGAGAGCAC CCTGTAATTC TACAACGTTT CTCGATTTCT TCGTTTTATA 672 0 

CTGTGTGCAA ATAACTTGAT TATCACCTTG TGCCAAGACT TTGACAGCAG TGTTTCTGAT 6780 

TTTAGCCTCT CTTTGAATAA CCAGTAGATT GAGGATAGTC CATCCTTTTT GCCGTAGACC 684 0 

TTCCAGTCCA CCCTCTTGTC CTTGCCAACA AACTCGTTGG GAGGTTGAAT TGATCAGTGT 6900 

GTTGTTGTGA ACACGCATCA AGTCTGGTCT TCCATTGTAG TATATAAGAC TTTTCTCAAA 696 0 

AAATTCATGA GTTCTCTCGA TTAAGGATGG ATAAC CTAAG AACTGGCCCA TAACTCGGAA 702 0 

CACTGGGCCG TTTGATAACT TCCTTTGGTG GTTATTCCAT TTTTCGTAAT CAATGTGATT 708 0 

GGCTATGCAA ATTGCCTCAT ATGACTTCAA TCCTTGGCCG GATGAGGAAT CTAACATCTT 714 0 

TTTAATGACT GCAGTTAGAT CGTCCGCCAT TGTCAGGCCT TTAAACATAG GGACGAAATG 7200 

AGTCTTTATC AAATATTCGG TAATTACAAA GTATTCTCGC AATTTCCAAG ACATTAGGGA 7260 

GAAAAATCTA CCTGCCAACT TCAGTTCCCT CTCCTTTCCT TTAAGACCAA TAATTAGATC 732 0 

ATCATCATCT AAGCCCTTCT CATCAATCTC TTTAAGAAAT TCTTTCCAAT TGGTAGCCTT 7380 

TGTGTCCAAC ATAGTCTGCA ACACCTTTTT ACTAGGGATA GGAGTGTTCG GATTCATTCG 744 0 

GACATGTTTC AACACCTCTG ACCTATTCAT TGAATGACTT TTGTCAGAGT ATATTATCGA 750 0 
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TGGGTCTAGT 


AAGTCGGGTA 


TTTCAAAACA 


TTTAATCAGC 


GGAAGTTCAT 


GCCATTTATC 


7560 


TCCAAAATCT 


TGAACTTGAG 


CAGCTGTGGG 


CCATGTATTT 


TCTTTAACAT 


GACTTTTAAA 


7620 


GGGATGATCA 


TGAGGGAGCA 


AGTCTCCATT 


CACGAACCAC 


TTTTTATGAT 


CATTGAACTG 


7680 


TTGAAATAGA 


ACAATCCGAG 


CTAAAT CACT 


TGCAAGTGCT 


TTTGCATATG 


ACACATCAAT 


7740 


ATCTTTCTTC 


ATGGTTACTT 


GGGAATGTAA 


TTTTTCTAGT 


CCAGTGTAAT 


AATCTATAAA 


7800 


AGGATGACCC 


CAATGTCTGA 


ACG AT C CAT A 


AATCACCAGT 


GTGAGATCCA 


CTGTTTTCAC 


7860 


ACTCATTATC 


TGATCATGGA 


GGAATCTTAT 


ACCTCGGTCA 


ATTTTTGCCC 


CTTCATCAAC . 


7920 


AGAAGTCTTG 


ATATGATTTT 


CAAAATGAGG 


GAATTGTGGG 


ACTAAAGGCC 


TTGATTCTCT 


7980 


TGCTAATTTC 


ATCAGCTTCA 


AGTTGCATAT 


CGGTTCCACC 


ATTTTAATCA 


AGTCATAAGA 


8040 


AAAATTTCCC 


TGCCTCTCCA 


CAATTTTATC 


TCCAATTCTG 


TAGATATTTA 


GAAGGGAGAA 


8100 


GATGTCTTGC 


TCTGAGAACA 


GGTTGTCTAT 


TCTACATACC 


ATGGATAGCA 


CCGTTTGCAT 


8160 


CCTCCCTATA 


ATCACATCTT 


TGACCATTAA 


CAGAAAGTTT 


CGGTCCATTA 


GAAT AT CAA.G 


8220 


TTTCTTGAAG 


TAAGCCCATC 


CTTCTGAAAT 


AAAAGTAGGA 


CCCAAGCTGG 


GAACCCTAAT 


8280 


CCTGCATATG 


TTCGTTCCAT 


GAGAACTTCT 


TCTGACTTTG 


CCTTTGAAAG 


TCCTCGCCAA 


8340 


GTTGAGCAAT 


TCCACCTCAG 


AGACAGCATT 


TAAGATTAAT 


GTCAACTTGT 


GTAAGTCCAA 


8400 


AAACTTTTGA 


CACAAATAAG 


CGAGAATTTT 


GAATGAGTCA 


GTCCATCTTT 


CCTTTTTGAT 


8460 


GTATTCAATT 


GGTTTGTTGC 

• - 7 

TTGTCCACrT 


CCCAGCCGCG 


GATGAAGGTC 


TCCACCACGT 


CAAATGTTAT 


8520 


TTCTGCCTCT 


CATGTAAAAA 


ACTATACCCT 


TG ACTGG CAT 


CATGATTATC 


8580 


AGACATTAAC 


CAACTTCCCA 


TCCATTTATG 


CATCTGAGAT 


GTTGAGATGG 


GATTGGCTTG 


8640 


ACATGATGTT 


AACATCTCAA 


GAACTCCATC 


CCAGTTCTTA 


CTATCCCACA 


TCGAGGGAAT 


8700 


CGGAAGAGAA 


TTGAATTTCC 


TGATCAAATT 


GTCAATATCA 


TCACTAATTA 


GAGGAGAATT 


8760 


CAAATTGTAA 


TCAGCATGAT 


TCAAGTACGT 


CATGCGCTCA 


TCGGGATTCA 


GGAATTCTCT 


8820 


TGTGGCATAG 


TCATCTTCAT 


TGAAATCATT 


GAACTCGTCG 


GTCTCAAAAT 


CGTGGACTTC 


B880 


CATGATTGCT 


GTTAGTTTTT 


TTCATAAAAA 


TTAAAAACTC 


AAATATAATT 


GAGGCCTCTT 


B940 


TGAGCATGGT 


ATCACAAGTT 


GATTTGGTCC 


AAACATGAAG 


AATCTGGCTA 


GCAGGATTTG 


9000 


AGTTACTTTC 


CAAGTCGGTT 


CATCTCTATG 


T CTGTATAAA 


TCTGTCTTTT 


CTTGGTGTGC 


9060 


TTTAATTTAA 


TGCAAAGATG 


GATACCAACT 


CGGAGAACCA 


AGAATAGTCC 


AATGATTAAC 


9120 


CCTATGATAA 


AGAAAAAAGA 


GGCAATAGAG 


CTTTTCCAAC 


TACTGAACCA 


ACCTTCTACA 


9180 


AGCTCGATTG 


GATTTTTGGA 


TAGCCCAGTA 


T C AC CAAAAA 


ATAAACTCTC 


ATCATCAGGA 


9240 


AGTTGCGAAG 


CAGCGTCTTG 


AATGTGAGGA 


TGTTCGAACA 


CCTGAGCCTT 


TGAG CTAAGA 


9300 


TGAAGATCGG 


AGTCCAACAT 


ACCATGTCCA 


ATCATGTATA 


AAGGAAACTT 


ATATCCTGAA 


9360 


CTGGTCCTCA 


GAACTCCATT 


GGGTCCAATT 


TCCACGTCTT 


CATATGGTGC 


CCAGTCATCC 


9420 


CACAGTTCCC 


TTTCTGTGGT 


AGTTCCACTG 


ATCATTCCGA 


CCATTCTTGA 


GAGGATTGGA 


9480 


GCAGCAATAT 


CGACTCTGAT 


GTATCTGGTC 


TCAAAGTATT 


TTAGGGTACC 


ATTGATTATG 


9540 
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GTGAAAGCAG GACCGG TTCC TGGGTTTTTA GGAGCAAGAT AGCTGAGATC CACTGGAGAG 96 00 

ATTGGAAGAC CCGCTCTGAT TTTGCTCCAG GTTTCTTGGC AGAGGGAATA ATCCAAGATC 966 0 

CTCTCAACGT CCTGAATTAG ACTTACATCC ACTGAGGTCT GAGATGGAGC AGAGATACTT 9720 

GACCCTTCTG GGCATTCAGG GAATCTGGCT GCAGCAAAGA GATCCTTATC AGCCATCTCG 978 0 

AACCAGACAC CTGATGGGAG TCTGACTCCC CAATGCTTGC AGTATTGCAT TTTGCAGGCC 984 0 

TTGCCTCCAG TTTCATAAGC AAAGTAGTTA CTTCTGAACC CTGTGCCCTC CTTTCCCAGG 9900 

GATGATAGCT CTCCGTCCTC TGAGAAGAAG GTGATGTCCA TGGAAATGAG GTTAGAATCA 99S0 

CATAGCCCTT TGACCTTATA GTCAGAATGC CAGGTTGTAG AGTTATGGAC AGTGGGGCAT 10020 

ATGTAATTGC TGCATTTTCC GTTGATGAAC TGTGAATCAA CCCATTCTCC TGTGTATTCA 10 080 

TCAACCAGCA CATGGTGAGG AGTCACCTGG ACAATCACTG CTTCGGCATC CGTCACAGTT 1014 0 

GCATATCCAC AACTTTGAGG AGGGAAGCCT GGATTCAGCC AAGTTCCTTG TTTCGTTTGT 10200 

TCAATGCTTT CCTTGCATTG TTCTACAGAT GGAGTGAAGG ATCGGATGGA CTGTGTTATA 10260 

TACTTCGGTC CATACCAGCG GAAATCACAA GTAGTGACCC ATTTGGAAGC ATGACACATC 10320 

CAACCGTCTG CTTG AAT AG C CTTGTGACTC TTGGGCATTT TGACTTGTAT GGCTGTGCCT 10380 

ATTAAGTCAT TATGC CAATT TAAAT CTGAG CTTGACGGGC AATAATGGTA ATTAGAAGGA 1044 0 

ACATTTTTCC AGTTTCCTTT TTGGTTGTGT GGAAAAACTA TGGTGAACTT GCAATTCACC 10500 

CCAATGAATA AAAAGG CTAA GTACAAAAGG CACTTCATAG TGACGGGTAA ACAGATCGAT 1056 0 

CTCTGTTAGT TTTTTTCATA GGGATAGAAA AGACAGGATA TTAGTTGTTC GaGAGGCTGG 10620 

AATTAGGAGA GACTGAGTAA ACCGGGGATT GTTCAGAAGC TAGAAGTTAG ACTAGCTCAT 10680 

TTGAAGTGGC TGATAGAATC CAGGACCCAC GCTCCAGATG CCTTTTTCTC GACAATCAGG 10740 

CCAAACATTA AGGCCTTCTC TCTGAAATCA. GAAAATTTGG AAGAATTGAA ATGATCCCAG 10800 

ATCATAGGAG CTGCTTCCAG TGACTCATCA TCGTAGATGG TCATTGTGAG CTCAATCGTT 10860 

CCCTTGTAAA GACCTATATT GAATGGTCTT CTGAAGTGCT CTGGTACATT GAG CATGGG A 10920 

GGGGTCTTCC CCATCCTATG TGGCAAATAA GCCCTGCCTT CGCAGTGAGT GTGATACTCT 10980 

GGTTGACCTT GATCTGCCAA TACCG CTGG A GTGGCCTTTA GATTAGAAGA ACCCAAAAAA 1104 0 

GCCAAGATTT TGTAGAAGGG ACGTTTCCCT GCCATTCCGA TGTACATGTG ATC CCAATGG 11100 

GATACAGCGG CTGCCACATC TGAGTATGTT CTGAACGGAC GATTAGATCT AACCGTCATT 1116 0 

TTCACTGTAA AGAAGAATTT CTCATATCTT AATTGATTCG GATCATAGGT GTCCATCTCG 11220 

TCAACTCCAA AATAGGATTT GTCAATTGGA GCGCTCGGAG CATACTCCAT GCTAGTGTCC 112 8 0 

TCTTCATAAG GGGGTGGTGC GATCCCTAAT TTCTTAGATT TCTTACCTTT CCCCTTCAGA 11340 

CCGAGAATCT TCTTTAAGGA ACTCATGATG AATGGATTGG GATAACACTT AGATCGTGAT 114 00 

ATCTGTTACT TTTTTTCATA GTCTACAGAG AATATTTGAC TCTCGCCTGA TTGTACAACT 1146 0 

TTTTGTATCT CAGGCCGAGC AGGATGGCCT CTTTATGAGA CATTCGTCCG TCACCTCCGA 1152 0 

CAGAGATGAA CTCTCCTCTA GATGAGAACA ATTCATCCAA GGATATGGTG AGAGGCTGAa' 1158 0 
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11640 
11700 



12000 
12060 



GACTTGCTTT CTTGGGTTGG AAAGTCATGG ATGTCTTTGA GAGAGACCAA ACATCTGATA 
CTGCTTCTGA TTGGGACGGA TGTGTGTTCA TCACTGGAGT GACCTTATAT ACATCCGGAG 
TTATCTGGCG CTCCTTCATA ATGACCCCTT CTCCCGATGC TTCAAATGTG CACTCTGCCA 11760 
GATTCCAGTA TTTGGCACTT TGCACGACTG CTTTAATCGT CGAAAGCCAC TGGGATTTCT 11820 
GCTCTCCACT TAAACCCTCT GGCGATGTCA ACCGTAAGGT CTTTCCATGC TCGTCAGATT 11880 
CAAGCTCAGG CTGTTTCCAG TCCGAAGTAA ATACAACATC CACTTCCTCA TCTGCATAGT 1194 0 
CATCTAAAGG CCCCTGTATA AAGCCTTCAA CTTGCTCAGC TTCTGGATCT GGTGCATACA 
AACCTTGATT GTCTTCAATT TCTGGTTCAG ATTCTGTGTC AGAATCATCT GCTGCCTGAA 
AATAAGAGGG CTTAGTATGC TCTTCCACTC CATCCTCTTG GAACAACTCA TAATTGGACT ' 12120 
TTTCAGCTCG TTGTGCTTCG ATCTCATCTA TCTCTCCTAC CGCCTGATCC AGACGAGAAT 12180 
AGGACTTGAG ATACTCACGA ACTTTTGTGA GA7TATCCAT GATATCTGTT AGTTTTTTTC 12240 
ATATGTAGCA TAATATATAA TAGGTGATCT GAGAATTATA GGGTCATTTG TCAAATTCTG 123 00 
ACTTAGCATA CTTGCCAATT GTCTTCTCTC TTAGGCCTTG CAGTGACATG ACTGCTCTTT 12360 
TCGCATACTG CATCATATCA GGAGTCGGTT TTCTGTTTTG ATCTTCAAAC CATCCGAGCC 12420 
ATTCGACCAC ATCTCTGCCT TGTGGCGGTG CATTAGTCGT CAATCCTCCG GTACTATCAT 12480 
CTGGAGTGTA TTTGTTATCT CCAACACAAA ACTGTTGTGC CAAGTCGGCA GAGGATCCTA 12540 
CTGCATAAGC GTACAACAAA CCTGCTGTAG TAAGAGATGT ATACTCAATG TCATCAGGCT 12600 
GTCGGGCATT CCTTGCTCi'G GTGCATGVGA GCAGAAGAGC TGTCAATTGC CCCCAGAAGT 12660 
GGAAGGCAGG GTTTTTGACG GAAGAATATG GAGACTTAGA AGACAATCCA AAGTCGATCA 12720 
AATAAGGCAT GTATGAATCG GCCTTGTCAA TTTCTTGGCC TGGAAGCATC ATTTGGACCA 12780 
TTTCATCTGC AACTTCTCGG TTCAAGATCC AGGTCGTTAC ATCTTCTGTA GACATTCCGG 12S40 
TTATTTTGCA GAGGTGTCCA AATGTTGCCA ATGCAGCACA ATCTTTGAAT CTGGAAACAA 12900 
TAGTTCCGTA TCTGAACGAG GCACATTCAT GTTTTTTGAA CATGTGGAAG AACATGTCCA 12 960 
CTGCAGCGAC AATTTTTGTG TAATTACTGT CATTTCCCCA CACATCAAAA ATGTCACGAC 
CTTCTGGCAC AAGAGGTTCA AACTGTTCAT TGATCATTTT GCATTGATTT GTCAGCCCAT 
CCATGAGCTT TTTTCTGTAT TCAGGCATTT GTGTTCTGCC CACTCTGTAT AAGCCAAGTA 
GATACAAAGG CAACCATTTG TCATCTGCGC TGGTTCTGGA AGCATCCGAT ACTCCATCTG 
GAAGTACGCC GTCCAGGGCT TTCAAGGATA CAAGGTCAAA TATTCCGATT GTATCCCCTG 13260 
CTTTCCCGAT GTTTATTCCG AAACTTGACC AATCTTTATC CAACTTACCC CGGATGTCCT 13320 
TTAATGCTCC ATACAAGTAG CTGTTGACAT GTATGATTGA TACATTTCCG GATTTGAGGC 
CTTGGTAGAC ATATCCTCTT AGATCTGACA AACTTTTTGT AGTATTGATG TAAAGAGGAA 
TCTCCTTTGA TTTTCTGAAG TAATCTGCCG GGTATTCCAC TGGATCCTCA TTTGCAGGAA 
GTTTTGGAAC TATGACTGTG TTGTCAATGA TTCTCTTGAC TGTAACAGAC ATTTTGATTA 
CTGTTAAAGT TTCTCCTGAG CCTTTTAATG ATAATAATGG TTTGTTTGTC TTCGTCCCTA 



13020 
13080 
13140 
13200 



13380 
13440 
13500 
13560 
13620 
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TAGTGAGTCG TATTACAACT CGAGGGGGGG CCCGGTACCC AATTCGCCCT ATAGTGAGTC 
GTATTACAAT TCACTGGCCG TCGTTTTACA ACGTCGTGAC TGGGAAAACC CTGGCGTTAC 
CCAACTTAAT CGCCTTGCAG CACATCCCCC TTTCGCCAGC TGGCGTAATA GCGAAGAGGC 
CCGCACCGAT CGCCCTTCCC AACAGTTGCG CAGCCTGAAT GGCGAATGGG ACGCGCCCTG 
TAGCGGCGCA TTAAGCGCGG CGGGTGTGGT GGTTACGCGC AGCGTGACCG CTACACTTGC 
CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC TTTCTCGCCA CGTTCGCCGG 
CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT CCCTTTAGGG TTCCGATTTA GTGCTTTACG 
GCACCTCGAC CCCAAAAAAC TTGATTAGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG 
ATAGACGGTT TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT 
CCAAACTGGA ACAACACTCA ACCCTATCTC GGTCTATTCT TTTGATTTAT AAGGGATTTT 
GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA CAAAAATTTA ACGCGAATTT 
TAACAAAATA TTAACGCTTA CAATTTAGGT G 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 8 
TTGTAATACG ACTCACTATA GGG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



13680 

13740 

13800 

13860 

13920 

13980 

14040 

14100 

14160 

14220 

14280 

14311 



23 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTGTAATACG ACTCACTATA GGG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ACGAAGACAA ACAAACCATT ATTATCATTA AAAGGCTCAG GAGAAACTTT 50 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii). MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGCTGCTAAC AAAGCCCGAA AGGAAGCTGA GTTGGCTGCT GCCACCGCTG AGCAATAACT 60 
AGCATAACCC CTTGGGGCCT CTAAACGGGT CTTGAGGGGT TTTTTGCTGA AAGGAGGAAC 12 0 

TATATCCGGA TCGAGA 

136 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



83 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGGTCGGCAT GGCATCTCCA CCTCCTCGCG GTCCGACCTG GGCATCCGAA GGAGGACGTC 60 
GTCCACTCGG ATGGCTAAGG GAG 

(2) INFORMATION FOR SEQ ID NO: 13: „ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 0 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 13: 
TTGTAGAAGG TTGGTTCAGT AGTTGGAAAA GCTCTATTGC CTCTTTTTTC TTTATCATAG 
GGTTAATCAT TGGACTATTC TTGGTTCTCC GAGTTGGTAT CCATCTTTGC ATTAAATTAA 
AGCACACCAA GAAAAGACAG ATTTATACAG ACATAGAGAT GAACCGACTT GGAAAGTAAC 



60 
120 
180 
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TCAAATCCTG 


CTAGCTATGA 


AAAAAACTAA 


CAGATATCCA 


ACCCGGGAGC 


TAGTTGCGGC 


240 




CGCCTAGCAG 


ATTCTTCATG 


TTTGGACCAA ATCAACTTGT 


GATACCATGC 


TCAAAGAGGC 


300 




CTCAATTATA 


TTTGAGTTTT 


TAATTTTTAT 


GAAAAAAACT 


AACAGCAATC 


ATGGAAGTCC 


360 


M 


ACGATTTTGA 


GACCGACGAG 


TTCAATGATT 


TCAATGAAGA 


TGACTATGCC 


ACAAGAGAAT 


420 


m 


TCCTGAATCC 


CGATGAGCGC 


ATGACGTACT 


TGAATCATGC 


TGATTACAAT 


TTGAATTCTC 


480 


m 


CTCTAATTAG 


TGATGATATT 


GACAATTTGA 


TCAGGAAATT 


CAATTCTCTT 


CCGATTCCCT 


540 


* 


CGATGTGGGA 


TAG7AAGAAC 


TGGGATGGAG 


TTCTTGAGAT 


GTTAACATCA 


TGTCAAGCCA 


600 




ATCCCATCTC 


AACATCTCAG 


ATGCATAAAT 








630 




(2) INFORMATION FOR SEQ ID NO: 14 : 













(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



ATTTATGCAT 


CTGAGATGTT 


GAGATGGGAT 


TGGCTTGACA TGATGTTAAC ATCTCAAGAA 


• 60 


CTCCATCCCA 


GTTCTTACTA 


TCCCACATCG 


AGGGAATCGG AAGAGAATTG AATTTf.CPGA 


1 PC 


TCAAATTGTC 


AATATCATCA 


CTAATTAGAG 


GAGAATTCAA ATTGTAATCA GCATGATTCA 


180 


AGTACGTCAT 


GCGCTCATCG 


GGATTCAGGA ATTCTCTTGT GGCATAGTCA TCTTCATTGA 


240 


AATCATTGAA 


CTCGTCGGTC 


TCAAAATCGT 


GGACTTCCAT GATTGCTGTT AGTTTTTTTC 


300 


ATAAAAATTA AAAACTCAAA 


TATAATTGAG 


GCCTCTTTGA GCATGGTATC ACAAGTTGAT 


360 


TTGGTCCAAA 


CATGAAGAAT 


CTGCTAGGCG 


GCCGCAACTA GCTCCCGGGT TGGATATCTG 


420 


TTAGTTTTTT 


TCATAGCTAG 


CAGGATTTGA 


GTTACTTTCC AAGTCGGTTC ATCTCTATGT 


480 


CTGTATAAAT 


CTGTCTTTTC 


TTGGTGTGCT 


TTAATTTAAT GCAAAGATGG ATACCAACTC 


540 


GGAGAACCAA 


GAATAGTCCA 


ATGATTAACC 


CTATGATAAA GAAAAAAGAG GCAATAGAGC 


600 


TTTTCCAACT 


ACTGAACCAA 


CCTTCTACAA 




630 


(2) INFORMATION FOR SEQ ID NO: 15: 





(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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UCAGGAGAAA C 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 

• (ix) FEATURE: 

(A) NAME /KEY : 5' Gppp 

(B) LOCATION: 1 



11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
AACAGUAAUC 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GAUUACUGUU AAAGUUUCUC CUGA 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

, (ii) MOLECULE TYPE: RNA 



24 



(ix) FEATURE: 

(A) NAME /KEY : polyA 

(B) LOCATION: 10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GCUACAUAUG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



10 
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(ix) FEATURE: 

(A) NAME/KEY: 5' Gppp 
• (B) LOCATION: 1 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 19: 
AACAGAUAUC 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GAUAUCUGUU AGUUUUUUUC AUAUGUAGC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: R* rt 



(ix) FEATURE: 

(A) NAME /KEY: polyA 

(B) LOCATION: 10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

GUAGACUAUG 

10 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA ' 



(ix) FEATURE: 

(A) NAME /KEY: 5' Gppp 

(B) LOCATION: 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

AACAGAUAUC 

10 

(2) INFORMATION FOR SEQ ID NO: 23: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
GAUAUCUGUU ACUUUUUUUC AUAGUCUAC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



29 



(ix) FEATURE: 

(A) NAME /KEY : polyA 
<B) LOCATION: 10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
UAUCCCUAUG 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
'(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



10 



(ix) FEATURE: 

(A) NAME /KEY : 5' Gppp 

(B) LOCATION: 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AACAGAGAUC 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



10 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GAUCUCUGUU AGUUUUUUUC AUAGGGAUA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 



(ix) FEATURE: 

(A) NAME/KEY: polyA 

(B) LOCATION: 10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AAUUUUUAUG 

(2) INFORMATION FOR SEQ ID NO: 28: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 



(ix) FEATURE: 

(A) NAME /KEY: 5' Gppp 

(B) LOCATION: 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
AACAG CAAUC 

(2) INFORMATION FOR SEQ ID NO; 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
GAUUGCUGUU AGUUUUUUUC AUAAAAAUU 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: RNA 



( ix ) FEATURE : 

(A) NAME/KEY: polyA 

(B) LOCATION: 10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
UUUAAGUAUG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 7 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AGGAUCAAAG UUUUUUUCAU ACUUAAA 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CATTCAAGAC GCTGCTTCGC AACTTCC 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
CATGAATGTT AACATCTCAA GA 
(2) INFORMATION FOR SEQ ID NO: 34: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA 



(ix) FEATURE: s 

(A) NAME/KEY: miscellaneous feature 

(B) LOCATION: 11. .12 

(D) OTHER INFORMATION: Intergenic dinucleotide 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
GAUNNCUGUU ANUUUUUUUC AUA 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
U^OGA^AAA A 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: RNA 



11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
UUUUUUUCAU A 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 
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TATGAAAAAA A 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

CCGGCTCGAG TTGTAATACG ACTCACTATA GGGACGAAGA CAAACAAACC ATTATTATC 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
GAACTCTCCT CTAGATGAGA AC 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
AGGTCGGACC GCGAGGAGGT GGAGATGCCA TGCCGACCCA CGAAGACCAC AAAACCAG 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

h 

4' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
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ATGTTGAAGA GTGACCTACA CG 

22 
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WHAT IS CLAIMED IS ; 

1. A recombinant replicable vesiculovirus, the 
genome of which comprises a foreign RNA sequence. 

5 

2. The vesiculovirus of claim 1 in which an RNA 
sequence complementary to said foreign RNA sequence encodes a 
peptide or protein that is expressed in a suitable host 
infected by the virus. 

10 

3. The vesiculovirus of claim 1 in which the 
peptide or protein is antigenic or immunogenic. 

4 . The vesiculovirus of claim 2 in which the 
15 peptide or protein displays the antigenicity or 

immunogenicity of an antigen of a pathogenic microorganism. 

5. The vesiculovirus of claim 4 in which the 
pathogenic microorganism is a virus . 

20 

6. The vesiculovirus of claim 4 in which the 
pathogenic microorganism is a bacterium. 

7. The vesiculovirus of claim 4 in which the 
25 pathogenic microorganism is a parasite. 

8. The vesiculovirus of claim 4 in which the 
pathogenic microorganism is a human pathogen. 

30 9. The vesiculovirus of claim 4 in which the 

pathogenic microorganism is a non-human pathogen. 

10. The vesiculovirus of claim 2 in which the 
peptide or protein displays the antigenicity or 
35 immunogenicity of a tumor specific or tumor associated 
antigen. 
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1.1. Plasmid P VSVFL( + ), as deposited with the ATCC 
and assigned accession number 97134. 

12. A nucleic acid comprising the DNA sequence of 
5 plasmid pVSVFL(+) as depicted in Figure 1 from nucleotide 
numbers 623-12088 (a portion of SEQ ID N0:l), in which a 
region nonessential for vesiculovirus replication has been 
inserted into or replaced by foreign DNA. 

10 13. A host cell containing the recombinant 

vesiculovirus of claim 1. 

14. A method of producing a recombinant 
vesiculovirus comprising culturing a cell containing (a) a 
IS recombinant nucleic acid that can be transcribed to produce 
an RNA molecule comprising a vesiculovirus antigenomic (+) 
RNA, in which a region of the RNA nonessential for 
replication of the vesiculovirus has been inserted into or 
rc ? -uc«i ^ a fo reign RNA sequence; <Kyy* ricond recombinant 
20 nucleic acid encoding a vesiculovirus N protein; (c) a third 
recombinant nucleic acid encoding a vesiculovirus L protein; 
and (d) a fourth recombinant nucleic acid encoding a 
vesiculovirus P protein; whereby the first recombinant 
nucleic acid is transcribed in the cell to produce said RNA 
25 molecule, and the N, L and P proteins are expressed in the 
cell, and a recombinant replicable vesiculovirus is produced 
that has a genome that is the complement of said antigenomic 
RNA comprising said foreign RNA seguence. 

30 15 • The method according to claim 14 in which the 

cell is a mammalian cell. 

16. The method according to claim 14 in which the 
first recombinant nucleic acid comprises a first promoter 
35 seguence which controls the expression of the RNA molecule in 
the cell; the second recombinant nucleic acid comprises a 
second promoter sequence. which controls the expression of the 
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N protein in the cell; the third recombinant nucleic acid 
comprises a third promoter sequence which controls the 
expression of the P protein in the cell; and the fourth 
recombinant nucleic acid comprises a fourth promoter sequence 
5 which controls the expression of the L protein in the cell. 

17 . The method according to claim 14 in which said 
RNA molecule further comprises a ribozyme sequence 
immediately downstream of said antigenomic ( + ) RNA, that 

■ 

10 cleaves at the 3' terminus of the antigenomic ( + ) RNA. 

18. The method according to claim 14 in which the 
first recombinant nucleic acid is a DNA plasmid vector 
comprising the following operatively linked components: 

15 (a) a promoter; 

(b) a DNA sequence that can be transcribed 
under the control of the promoter in the 
cell to produce said RNA molecule; and 

(c) a transcription termination signal. 

20 

19. The method according to claim 14 or 18 in 
which the second recombinant nucleic acid is a DNA plasmid 
vector comprising the following operatively linked 
components : 

25 (a) a promoter which controls the expression 

of the N protein; 

(b) a translation initiation signal; 

(c) a DNA sequence encoding the N protein; 
and 

30 (d) a transcription termination signal; and 

in which the third recombinant nucleic acid is a DNA plasmid 
vector comprising the following operatively linked 
components : 

(a) a promoter which controls the expression 
35 of the P protein; 

(b) a translation initiation signal; 
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(c) a DNA sequence encoding the P protein; 
and 

(d) a transcription termination signal; and 
and in which the fourth recombinant nucleic acid is a DNA 

5 plasmid vector comprising the following operatively linked 
components: 

(a) a promoter which controls the expression 
of the L protein; 

(b) a translation initiation signal; 

10 (c) a DNA sequence encoding the L protein; 

and 

(d) a transcription termination signal, 

20. The method according to claim 17 in which the 
15 first recombinant nucleic acid is a DNA plasmid vector 

comprising the following operatively linked components: 

(a) a promoter; 

(b) a DNA sequence that can be transcribed 
under the control of the p romoter in the 

20 ce H to produce said RNA molecule; and 

(c) a transcription termination signal. 

21. The method according to claim 19 in which the 
first recombinant nucleic acid, the second recombinant 

25 nucleic acid, the third recombinant nucleic acid, and the 
fourth recombinant nucleic acid, each further comprises a 
selectable marker. 

22. The method according to claim 14 in which the 
30 second, third and fourth recombinant nucleic acids form part 

of a single recombinant nucleic acid. 

23. The method according to claim 16 in which the 
first, second, third and fourth promoter sequences are RNA 

35 polymerase promoter sequences for the same RNA polymerase, 
and in which the cell also contains a cytoplasmic source of 
said RNA polymerase. 
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24. The method according to claim 2 3 in which the 
cytoplasmic source of said RNA polymerase is a recombinant 
vaccinia virus expressing said RNA polymerase in the cell. 

5 25. A method o^ producing a recombinant 

vesiculovirus comprising culturing a mammalian cell 
containing: 

(a) a first DNA plasmid vector comprising the 
following operatively linked components: 
10 (i) a bacteriophage RNA polymerase promoter; 

(ii) a first DNA sequence that is transcribed 
in the cell to produce an RNA molecule 
comprising (A) a vesiculovirus 
antigenomic (+) RNA in which a region of 
15 the RNA nonessential for replication of 

the vesiculovirus has been inserted into 
or replaced by a foreign RNA sequence, 
and (B) a ribozyme sequence immediately 
downstream of saiJ aiiLigertomic (+) RNA, 
20 that cleaves at the 3' terminus of the 

antigenomic RNA; and 
(iii) a transcription termination signal for 
the RNA polymerase; 

(b) a second DNA plasmid vector comprising the 
25 following operatively linked components: 

(i) the bacteriophage RNA polymerase 
promoter ; 

(ii) a second DNA sequence encoding an N 
protein of the vesiculovirus; and 
30 (iii) a second transcription termination signal 

for the RNA polymerase; 

(c) a third DNA plasmid vector comprising the 
following operatively linked components: 

(i) the bacteriophage RNA polymerase 
35 ' promoter; 

(ii) a third DNA sequence encoding a P protein 
of the vesiculovirus; and 
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(iii) a third transcription termination signal 
for the RNA polymerase; 
(d) a fourth DNA plasmid vector comprising the 
following operatively linked components: 

5 s (i) the bacteriophage RNA polymerase 

promoter; 

(ii) a fourth DNA sequence encoding an L 
protein of the vesiculovirus; and 
(iii) a fourth transcription termination signal 
10 f or the RNA polymerase; and 

(e) a recombinant vaccinia virus comprising a 
sequence encoding the bacteriophage RNA polymerase; whereby 
in said cell the first DNA sequence is transcribed to produce 
said RNA molecule, the N , P, and L proteins and the 
15 bacteriophage RNA polymerase are expressed, and a recombinant 
replicable vesiculovirus is produced that has a genome that 
is the complement of said antigenomic RNA comprising said 
foreign RNA sequence. 



20 



25 



26. The method according to claim 25 in which the 
ribozyme sequence is the seguence of the hepatitis delta 
virus ribozyme, and the bacteriophage RNA polymerase is the 
T7 RNA polymerase. 

27. The method according to claim 14 in which the 
foreign RNA sequence is the complement of a sequence encoding 
a protein or peptide that is immunogenic or antigenic. 

28. The method according to claim 25 in which the 
30 foreign RNA sequence is the complement of a sequence encoding 

a protein or peptide that is immunogenic or antigenic. 

29. The method according to claim 27 or 28 in 
which the protein or peptide displays the antigenicity or 

35 immunogenicity of an antigen of a pathogenic microorganism. 
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30. The method according to claim 27 or 28 in 
which the protein or peptide displays the antigenicity or 
immunogenicity of a tumor specific antigen. 

i 

5 31, The method' according to claim 25 which further 

comprises before step (a) the step of introducing said first 
second, third and fourth plasmid vectors and said recombinant 
vaccinia virus into said cell. 

10 32. A host cell comprising (a) a recombinant 

nucleic acid that can be transcribed to produce an RNA 
molecule comprising a vesiculovirus antigenomic (+) rna in 
which a region of the RNA nonessential for replication of the 
vesiculovirus has been inserted into or replaced by a foreign 

15 RNA sequence;- (b) a second recombinant nucleic acid encoding 
a vesiculovirus N protein; (c) a third recombinant nucleic 
acid encoding a vesiculovirus L protein; and (d) a fourth 
recombinant nucleic acid encoding a vesiculovirus P protein. 

20 33. A host cell comprising: 

(a) a first DNA plasmid vector comprising the 
following operatively linked components: 

(i) a bacteriophage RNA polymerase promoter; 
(ii) a first DNA sequence that is transcribed 
in tlle cell to produce an RNA molecule 
comprising (A) a vesiculovirus 
antigenomic (+) RNA in which a region of 
the RNA nonessential for replication of 
the vesiculovirus has been inserted into 
or replaced by a foreign RNA seguence, 
and (B) a ribozyme seguence immediately 
downstream of said antigenomic (+) rna, 
that cleaves at the 3' terminus of the 
antigenomic RNA; and 

35 a transcription termination signal for 

the RNA polymerase; 
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(b) a second DNA plasmid vector comprising the 
following operatively linked components: 

(i) the bacteriophage RNA polymerase 
promoter; 

5 a second DNA .sequence encoding an N 

protein of the vesiculovirus; and 
(iii) a second transcription termination signal 
for the RNA polymerase; 

(c) a third DNA plasmid vector comprising the 
10 following operatively linked components: 

(i) the bacteriophage RNA polymerase 
promoter ; 

(ii) a third DNA sequence encoding a p protein 
of the vesiculovirus; and 
15 a third transcription termination signal 

for the RNA polymerase; 

(d) a fourth DNA plasmid vector comprising the 
following operatively linked components: 

\±) the bacteriophage RNA pc\yr>*rase> 
20 t promoter; 

(ii) a fourth DNA sequence encoding an L 
protein of the vesiculovirus; and 
(iii) a fourth transcription termination signal 
for the RNA polymerase; and 
25 < e > a recombinant vaccinia virus comprising a 

sequence encoding the bacteriophage RNA polymerase; whereby 
in said cell the first DNA sequence is transcribed to produce 
said RNA molecule, the N, P, and L proteins and the 
bacteriophage RNA polymerase are expressed, and a recombinant 
30 replicable vesiculovirus is produced that has a genome that 
is the complement of said antigenomic RNA comprising said 
foreign RNA seguence. 

34. A vaccine formulation comprising an effective 
35 immunizing amount of the vesiculovirus of claim 3; and a 
pharmaceutical^ acceptable carrier. 
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35. A vaccine formulation comprising an effective 
immunizing amount of the recombinant vesiculovirus of claim 
4; and a pharmaceutically acceptable carrier. 

5 36. A vaccine formulation comprising an effective 

immunizing amount of the recombinant vesiculovirus of claim 
10; and a pharmaceutically acceptable carrier. 

37. A vaccine formulation comprising an effective 
10 immunizing amount of a first recombinant vesiculovirus of 

claim 3; an effective immunizing amount of a second 
recombinant vesiculovirus of claim 3; and a pharmaceutically 
acceptable carrier; in which said peptide or protein 
expressed by the first and second recombinant vesiculoviruses 
15 are different. 

38. The vaccine formulation of claim 37 in which 
J the peptide or protein expressed by the first recombinant 

vesiculovirus- displays the antigenicity or immunogenicity of 
20 an antigen of a first pathogenic microorganism, and the 
peptide or protein expressed by the second recombinant 
vesiculovirus displays the antigenicity or immunogenicity of 
an antigen of a second, different pathogenic microorganism. 

25 39. A method of treating or preventing a disease 

or disorder in a subject comprising administering to the 
subject an effective immunizing amount of the recombinant 
vesiculovirus of claim 3. 

30 4 0. A method of treating or preventing a disease 

or disorder in a subject caused by infection by a pathogenic 
microorganism comprising administering to the subject an 
effective immunizing amount of the recombinant vesiculovirus 
of claim 4. 

35 

41. A method of treating or preventing a subject 
having a tumor comprising administering to the subject an 
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effective immunizing amount of the recombinant vesiculovirus 
of claim 10. 

42. The method according to claim 39 in which the 
5 subject is a human. 

43. The method according to claim 39 in which the 
subject is a non-human animal. 

10 44. A kit comprising in one or more containers (a) 

a first recombinant nucleic acid that can be transcribed in a 
host cell to produce an RNA molecule comprising a 
vesiculovirus antigenomic ( + ) RNA, in which a region of the 
RNA nonessential for replication of the vesiculovirus has 

15 been inserted into or replaced by a foreign RNA sequence; (b) 
a second recombinant nucleic acid encoding and capable of 
expressing in the host cell a vesiculovirus N protein; (c) a 
third recombinant nucleic acid encoding and capable of 
expressing in the host cell a vesiculovirus- L proc^i,); and 

2 0 (d) a fourth recombinant nucleic acid encoding and capable of 
expressing in the host cell a vesiculovirus P protein. 

45. A kit comprising in one or more containers (a) 
a first recombinant nucleic acid that can be transcribed in a 

25 host cell to produce an RNA molecule comprising a 

vesiculovirus antigenomic ( + ) RNA; (b) a second recombinant 
nucleic acid encoding and capable of expressing in the host 
cell a vesiculovirus N protein; (c) a third recombinant 
nucleic acid encoding and capable of expressing in the host 

30 cell a vesiculovirus L protein; and (d) a fourth recombinant 
nucleic acid encoding and capable of expressing in the host 
cell a vesiculovirus P protein. 

46. A kit comprising in one or more containers (a) 
35 a recombinant nucleic acid that can be transcribed to produce 

an RNA molecule comprising a vesiculovirus antigenomic (+) 
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RNA; and (b) a host cell that recombinantly expresses 
vesiculovirus N, P, and L proteins. 

47. The kit of claim 45 in which said first 
5 recombinant nucleic acid further comprises a polylinker in a 
region that is transcribed to produce said RNA molecule, said 
region being nonessential for replication of the 
vesiculovirus . 

10 48. The kit according to claim 45 in which the 

second recombinant nucleic acid is a DNA plasmid vector 
comprising the following operatively linked components: 

(a) a promoter which controls the expression 
of the N protein; 
15 (b) a translation initiation signal; 

(c) a DNA sequence encoding the N protein; 
and 

(d) a transcription termination signal; and 
in which the third "recombinant nucleic acid is a DNA plasmid 

20 vector comprising the following operatively linked 
components : 

(a) a promoter which controls the expression 
of the P protein; 

(b) a translation initiation signal; 

25 (c) a DNA sequence encoding the P protein; 

and 

(d) a transcription termination signal 
and in which the fourth recombinant nucleic acid is a DNA 
plasmid vector comprising the following operatively linked 
30 components: 

(a) a promoter which controls' the expression 
of the L protein; 

(b) a translation initiation signal; 

(c) a DNA sequence encoding the L protein; 
35 and 

(d) a transcription termination signal; and 
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in which the first recombinant nucleic acid is a DNA plasmid 
vector comprising the following operatively linked 
components: 

(a) a promoter; 
5 (b) a DNA seguence that can be transcribed 

under the control of the promoter in the 
cell to produce said RNA molecule, said 
RNA molecule further comprising a 
ribozyme seguence immediately downstream 
10 of sa id antigenomic ( + ) RNA, that cleaves 

at the 3' terminus of the antigenomic (+) 
RNA, said DNA comprising a polylinker in 
a region that is transcribed to produce 
said RNA molecule, said region being 
15 nonessential for replication of the 

vesiculovirus; and 
■(c) a transcription termination signal. 

49. The kit of claim 48 in which the. ^o^oter is 
20 an RNA polymerase promoter, and in which the cell also 

contains a cytoplasmic source of said RNA polymerase. 

50. A kit comprising 

(a) in a first container a first DNA plasmid 
25 vector comprising the following operatively linked 
components : 

(i) a bacteriophage RNA polymerase promoter; 
(ii) a first DNA sequence that can be 

transcribed in a cell to produce an RNA 

30 molecule comprising (A) a vesiculovirus 

antigenomic (+) RNA in which a region of 
the RNA nonessential for replication of 
the vesiculovirus has been inserted into 
or replaced by a foreign RNA sequence, 

35 and (B) a ribozyme sequence immediately 

downstream of said antigenomic (+) RNA, 
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that cleaves at the 3' terminus of the 
antigenomic RNA; and 
(iii) a transcription termination signal for 
the RNA polymerase; 
5 (b) in a s second container a second DNA plasmid 

vector comprising the following operatively linked 
components : 

(i) the bacteriophage RNA polymerase 
promoter ; 

10 (ii) a second DNA sequence encoding an N 

protein of the vesiculovirus; and 
(iii) a second transcription termination signal 
for the RNA polymerase; 

(c) in a third container a third DNA plasmid 
15 vector comprising the following operatively linked 

components : 

(i) the bacteriophage RNA polymerase 
promoter; 

(ii) ;a third DNA sequence encoding a P protein 
2 0 of the vesiculovirus; and 

(iii) a third transcription termination signal 
for the RNA polymerase; 

(d) in a fourth container a fourth DNA plasmid 
vector comprising the following operatively linked 

25 components: 

(i) the bacteriophage RNA polymerase 
promoter ; 

(ii) a fourth DNA sequence encoding ah L 
protein of the vesiculovirus; and 
30 (iii) a fourth transcription termination signal 

for the RNA polymerase; and 

(e) in a fifth container a recombinant vaccinia 
virus comprising a sequence encoding the bacteriophage RNA 
polymerase. 

35 

51. A kit comprising in a container an effective 
immunizing amount of the vesiculovirus of claim 3. 
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52. The vesiculovirus of claim 1 which is a 
vesicular stomatitis virus. 

53. The vesiculovirus of claim 4 in which the 
5 peptide or protein displays the antigenicity or 

immunogenicity of an envelope glycoprotein of a virus other 
than a vesiculovirus . 

54. The vesiculovirus of claim 53 in which the 

10 envelope glycoprotein is an envelope glycoprotein of a Human 
Immunodeficiency Virus. 

55. The vesiculovirus of claim 53 in which the 
peptide or protein is incorporated into the vesiculovirus 

15 envelope. 

56. The vesiculovirus of claim 53 in which the 
peptide or protein is expressed as a fusion protein 
comprising the cytoplasmic domain of a vesiculovirus « 

20 protein. 

57. The vesiculovirus of claim 56 in which the 
native G protein of the vesiculovirus is not expressed. 

25 58. The vesiculovirus of claim 56 in which the 

native G protein of the vesiculovirus is also expressed. 

59. The vesiculovirus of claim 3 in which a second 
RNA sequence complementary to said foreign RNA sequence 
30 encodes a second peptide or protein that is expressed in the 
suitable host, in which the first peptide or protein and the 
second peptide or protein display different antigenicity or 
immunogeni c i ty . 

35 60. An inactivated recombinant vesiculovirus that 

is the product of a method comprising inactivating a 
recombinant replicable vesiculovirus, the genome of which 
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comprises a foreign RNA sequence, in which an RNA sequence 
complementary to said foreign RNA sequence encodes a peptide 
or protein that is expressed in a suitable host infected by 
the recombinant replicable vesiculovirus, and in which the 
5 peptide or protein is antigenic or immunogenic. 

61. The vesiculovirus of claim 60 in which the 
peptide or protein displays the antigenicity or 
immunogenicity of an antigen of a pathogenic microorganism. 

10 

62. The vesiculovirus of claim 60 in which the 
peptide or protein displays the antigenicity or 
immunogenicity of a tumor specific or tumor associated 
antigen. 

15 

63. The vesiculovirus of claim 60 in which a 
second RNA sequence complementary to said foreign RNA 
sequence encodes a second peptide or protein that is 
expressed in the .£u-£^s£t ; e.~host , in which the first peptide or 

2 0 protein and the second peptide or protein display different 
antigenicity or immunogenicity. 

64. The vesiculovirus of claim 60 in which the 
peptide or protein displays the antigenicity or 

25 immunogenicity of an envelope glycoprotein of a virus other 
than a vesiculovirus. 

65. The vesiculovirus of claim 64 in which the 
envelope glycoprotein is an envelope glycoprotein of a Human 

30 Immunodeficiency Virus. 

66. The vesiculovirus of claim 64 in which the 
peptide or protein is incorporated into the vesiculovirus 
envelope . 

35 

67. The vesiculovirus of claim 64 in which the 
peptide or protein is expressed as a fusion protein 
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comprising the cytoplasmic domain of a vesiculovirus G 
protein. 

68. A vaccine formulation comprising an effective 
5 immunizing amount of the vesiculovirus of claim 60; and a 

pharmaceutical^ acceptable carrier. 

69. A vaccine formulation comprising an effective 
immunizing amount of the vesiculovirus of claim 64; and a 

10 pharmaceutical^ acceptable carrier. 

70. A vaccine formulation comprising an effective 
immunizing amount of the vesiculovirus of claim 66; and a 
pharmaceutically acceptable carrier. 

15 

71. A method of treating or preventing a disease 
or disorder in a subject comprising administering to the 
subject an effective immunizing amount of the inactivated 
recombinant vesiculovirus of claim 60. 

20 

72. A kit comprising in a container an effective 
immunizing amount of the inactivated recombinant 
vesiculovirus of claim 60. 

25 73. A method for producing a peptide or protein 

comprising culturing a cell containing the recombinant 
replicable vesiculovirus of claim 2 in said suitable host; 
and recovering the expressed peptide or protein. 

30 74. A method for producing an inactivated virus 

vaccine comprising inactivating a recombinant replicable 
vesiculovirus, the genome of which comprises a foreign RNA 
sequence, in which an RNA sequence complementary to said 
foreign RNA sequence encodes a peptide or protein that is 

35 expressed in a suitable host infected by the recombinant 

replicable vesiculovirus, and in which the peptide or protein 
is antigenic or immunogenic. 
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